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Section  I 
INTRODUCTION 

A central  topic  of  modern  system  theory  Is  parameter  sensitivity. 

The  first  step  in  the  analysis  of  a system  is  to  obtain  a mathematical 
model  of  that  system.  Such  a mathematical  model  invariably  contains 
parameters  (fixed  constants  such  as  the  value  of  a resistor  in  an  elec- 
trical network)  which  are  known  with  varying  degrees  of  certainty.  Para- 
meter sensitivity,  then,  is  the  study  of  any  property  of  this  mathematical 
model  which  might  be  altered  by  a change  of  these  parameter  values  from 
their  nominal  or  assumed  values. 

To  limit  the  scope  of  the  discussion,  the  parameters  are  assianed  to  be 
constant  and  only  local  variations  of  the  parameters  from  their  nominal 
values  will  be  treated.  Also,  the  main  concern  will  be  with  variations  in 
the  mathematical  model  output  (or  system  response)  caused  by  these  local 
pertui-bations  of  the  parameters.  To  be  specific,  if  it  is  not  stated 
otherwise,  then  "parameter  sensitivity"  will  be  defined  as  the  Frechet 
derivative  of  the  mathematical  model  output  with  respect  to  the  unknown 
constant  parameters  evaluated  at  the  nominal  parameter  values.  This  has 
become  a fairly  standard  definition  of  parameter  sensitivity.  (See,  e.g., 
Tomovic  and  Vukobratovic  (Ref  29)). 

Notice  that  a derivative  (or  partial  derivative)  is  a linear  operator 
lag.,  (Ref  3:  l62))  and  hence  the  parameter  sensitivities  will  also  be 

referred  to  as  "sensitivity  operators".  Indeed,  this  is  the  central  theme 
of  this  presentation:  the  parameter  sensitivities  are  linear  operators 
and  by  treating  them  as  such  many  of  their  system  properties  become  more 
apparent  and  easier  to  deal  with.  By  further  limiting  the  discussion  to 
linear  mathematical  models  of  systems,  the  system  output  la  defined  by 
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the  s\im  of  linear  operators  on  the  initial  conditions  and  the  control 
input.  The  "sensitivity  operators"  are  then  defined  in  terms  of  the  Frechet 
derivatives  of  these  system  operators.  This  operator  structure  is  used 
as  a unifying  foundation  for  the  sensitivity-related  system  properties 
which  are  presented. 

Often  the  first  step  in  the  analysis  of  a system  is  to  identify  or 
estimate  the  system  parameters  from  experimental  meastirements , and  para- 
meter sensitivity  is  the  basis  of  many  numerical  techniques  for»parameter 
identification.  The  sensitivity  operators  are  intimately  related  to  the 
question  of  "identif lability";  that  is,  the  a priori  capability  to  deter- 
mine whether  or  not  a set  of  parameters  may  be  xmiquely  estimated  from  a 
given  set  of  experimental  measurements.  Also,  parameter  identification 
capability  may  be  optimized  by  proper  design  of  the  sensitivity  operators 
through  either  selection  of  the  initial  conditions  or  control  input.  The 
sensitivity  operators  jire  analogous  to  signal  strength  in  a communication 
channel,  and  it  will  generally  Improve  estimation  capability  if  the  sensi- 
tivities are  "large". 

Once  a system  model  with  fairly  acctirate  nominal  parameter  values  is 
obtained,  usually  the  next  step  in  system  analysis  is  to  design  a control 
function  to  achieve  some  objective  or  optimize  some  performance  criterion. 
Here  one  generally  wishes  the  system  to  be  forgiving  of  one's  Inaccurate 
knowledge  of  the  true  parameter  values,  and  so,  in  this  case",  one  would 
like  the  sensitivity  operators  to  be  small.  Associated  with  this  objective 
we  discuss  the  sensitivity-related  system  properties  of  insensitivity, 
sensitivity  operator  controllability,  and  optimal  design  of  a control  in- 
put to  minimize  the  value  of  the  sensitivity  operators. 

The  sensitivity-related  system  properties  mentioned  above  are  con- 
sidered in  this  dissertation,  but  they  are  examples  of  Just  a few  of  the 
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many  important  ways  which  parameter  sensitivity  enters  into  system  theory. 


Each  of  these  properties  is  ideally  treated  via  an  operator  viewpoint 
and  many  useful  digital  computational  techniques  are  thereby  obtained. 
Finally,  we  note  that  an  operator  structure  is  equeilly  applicable  in 
either  the  time  domain  or  frequency  domain,  but  to  limit  the  scope  of  the 
presentation  we  will  only  treat  the  time  domain  description  of  linear 
dynamic  systems. 


1.  BACKGROUND  LITERATURE 

The  earliest  interests  in  parameter  sensitivity  stemmed  mainly  from 
the  effect  which  parameter  variations  would  have  upon  control  system  per- 
formance. Bode  (Ref  h)  was  one  of  the  first  investigators  to  consider 
parameter  sensitivity  in  dynamic  system  analysis.  Broadly,  he  defined  the 
sensitivity  function  as  the  ratio  between  a given  percentage  change  in  a 
parameter  component  and  the  resulting  percentage  change  in  the  system  out- 
put (delivered  voltage  in  an  electrical  network).  Through  frequency  domain 
analysis  he  showed  that  feedback  could  significantly  reduce  the  "sensitivity 
function"  in  an  electrical  network.  This  original  work  spurred  a great  deal 
of  research  into  a frequency  domain  analysis  of  parameter  sensitivity  and 
an  analysis  of  the  effect  of  feedback  on  reducing  sensitivity  in  automatic 
control  systems.  (See,  e.g.,  Horowitz  (Ref  lit)  and  Horowitz  and  Shaked  (Ref  ^5)! 

Although  the  operator  approach  to  parameter  sensitivity  is  closely 
related  to  a frequency  domain  treatment,  this  specific  analysis  treats 
linear  dynamic  systems  in  a time  domain  state-variable  description  (Kalman 
(Ref  l6)).  Toraovlc  (Ref  28)  develops  a time-domain  description  of  para- 
meter sensitivity  for  linear  and  nonlinear  dynamic  systems  and  Introduces 
the  concept  of  the  "sensitivity  system";  that  is,  if  a system  is  described 
by  the  nonlinear  differential  equation 
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x(t;  b)  = f(x,  b,  t)  = d(b)  t e [t^,  t^]  (l) 

where  b Is  an  unknown  constant  parameter  with  nominal  value  b , then  the 

o 

linearized  differential  equation 

it  « ‘o’ ' -o'  * p S'  • '>.  '=> 

1 h a b^  |b  a b^ 


is  referred  to  as  the  "sensitivity  system" , and  for  t ^ t^ , 

?(t;  b ) = ^ (t;  b) 

° b = b 

o 


(3) 


is  termed  the  "sensitivity  function"  or  "parameter  influence  function". 

This  fundamental  differential  equation  description  has  been  the  basis 
for  nearly  all  of  the  time-domain  analyses  of  the  parameter  sensitivities. 
(See,e.g.,  (Ref  29)  (Ref  6)  as  general  references  on  sensitivity).  For 
general  nonlinear  systems  this  is  perhaps  the  most  appropriate  way  of 
dealing  with  parameter  sensitivity,  and  very  few  general  results  beyond 
this  basic  relationship)  have  been  obtained.  However,  for  linear  systems, 
and  in  particular  for  linear  time-invariant  ordinary  differential  equa- 
tion systems , there  has  been  a tremendous  amount  of  research  into  the 
structure  and  system  properties  of  the  "sensitivity  system".  Our  specific 
applications  focus  on  such  linear  time-invariant  ordinary  differential 
equation  systems,  and  so  we  discuss  the  relevant  literature  of  parameter 
sensitivity  for  such  systems.  However,  before  doing  so  it  is  Important  to 
comment  that  the  fundamental  difference  between  the  research  reported  herein 
and  that  of  previous  investigators  is  the  fact  that  we  do  not  use  a "sensi- 
tivity system"  differential  equation  representation  of  the  parameter 
sensitivities;  rather  we  use  an  operator  time-domain  description  of  the 
linear  system.  To  obtain  the  parameter  sensitivities  we  then  directly 
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differentiate  this  operator  description.  Therefore,  although  the  previous 
literature  on  sensitivity  is  quite  pertinent  in  that  many  of  the  results 
obtained  are  similar  or  identical  to  those  reported  here,  the  fundamental 
approach  taken  in  deriving  these  results  is  generally  quite  different, 
a.  Structural  Properties 
Consider  the  system 

x(t;u)  =•  A(b)x(t;u)  + B(b)u(t)  x(0)  ••  d(b)  telO,t^]  (4) 

where  x(t)  is  an  n-dlmensional  state  vector,  u(t)  an  r-dlmenslonal  control 
input,  and  b a p-dlmenslonal  real  parameter  vector  which  parameterizes  the 
matrices  A and  B and  the  initial  condition  vector  d.  It  is  asbumed  that 
these  latter  quantities  are  continuously  differentiable  with  respect  to 
the  individual  parimeter  components  b^  at  a nominal  value  of  the  parameter 
vector  b_  e r'®’.  Then  the  state  sensitivities  (g^^^(t;u)  = 


computed  from  the  so-called  "sensitivity  system"  (Ref  29) 


X(t;u)  ■ AX(t;u)  + Bu(t)  X(0)  » d 


where  all  the  quantities  are  evaluated  at  b^  c R^  and  are  defined  by 


x(t;u) 

C^^^(t;u) 

X(t;u)  i 


n(p+l)xl 


n(p+l)xl 
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The  subscript  vithln  parenthesis  indicates  partial  differentiation  with 
respect  to  the  parameter  component.  Therefore,  by  a direct  analysis  the 
complete  solution  for  all  of  the  parameter  sensltvities  would  require 
solving  n(p  + l)  coupled  linear  differential  equations;  if  n and  p are 
both  large  this  can  be  quite  a large  number  of  differential  equations 
indeed. 

However,  the  sti^icture  of  this  "sensitivity  system"  is  quite  unique 
and  many  interesting  facts  have  been  discovered  about  its  properties.  A 
nvimber  of  researchers  have  investigated  canonical  form  representations 
and  interrelationships  which  significantly  reduce  the  required  number 
of  differential  equations  for  producing  the  sensitivity  system.  (Befs 
3^-^2)  Much  of  this  work  stems  from  the  "sensitivity-points"  techniques 
Introduced  by  Kokotovlc  (Ref  37).  Varshney  (Ref  Uo)  provides  an  excellent"' 
summary  of  these  techniques,  and  shows  that  the  required  number  of 
differential  equations  for  generating  the  first  order  sensitivity  system 
may  be  reduced  from  n(p  + l)  to  n(r  + 3)  where  r is  the  control  dimension. 
He  also  considers  extensions  of  these  techniques  to  obtain  minimal  order 
models  of  the  second  order  sensitivity  operators  and  the  first  order 
sensitivities  for  linear  time-invariant  time-delay  systems. 
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More  recently  the  controllability  and  structured  properties  of  the 
sensitivity  system  have  been  an  area  of  considerable  Interest.  Holtzman 
and  Horing  (Ref  69)  examine  optimal  control  methods  for  making  the 
sensitivities  zero  at  the  terminal  time.  This  work  has  initiated  an 
interest  in  sufficient  conditions  for  uncontrollability  of  the  sensitivty 
system  (Refs  U0-53).  Through  a Jordan  canonical  form  analysis  of  the 
sensitivity  system  plant  matrix  A,  Guardabassi,  et  al,  (Ref  53»)  (Ref  52) 
investigate  the  structural  properties  and  the  controllability  of  the 
sensitivity  system.  In  reference  51  they  examine  the  minimal  polynomial  of 

A and  obtain  a general  formula  for  eigenvalue  sensitivity  based  upon 

• ■ 

the  Jordan  canonical  form  of  A.  They  conclude  that  the  eigenvalue 

multiplicities  of  A are  no  more  than  twice  that  of  the  original  A matrix. 

They  also  examine  uncontrollability  of  the  sensitivity  system  and  obtain 

a composite  of  sufficiency  conditions  for  uncontrollability.  In  reference  52 

they  give  further  consideration  to  the  Jordan  ceuionlcal  form  of  A for 

the  special  case  in  which  A has  no  repeated  eigenvalues.  They  show 

that  the  structiire  of  the  Jordan  canonical  form  of  A is  Intimately 

related  to  the  eigenvalue  sensitivities  and  that  sufficiency  conditions 

for  uncontrollability  are  also  related  to  the  eigenvalue  sensitivities. 

A general  statement  of  their  results  is  that  the  sensitivity  system  is 

always  uncontrollable  if  the  number  of  parameters  is  greater  than  the 

nvmber  of  control  inputs.  Finally,  for  the  special  case  of  single 

input  systems,  they  obtain  necessary  and  sufficient  conditions  for 

controllability  of  the  sensitivity  system.  These  conditions  are  Inde- 

P 

pendent  of  the  nominal  parameter  value,  b^  e R , and,  again  they  are 
dependent  upon  the  eigenvalue  sensitivities. 

Gupta  and  Mehra  (Ref  84  ) and  Gupta  and  Hall  (Ref  108)  consider 


controllability  of  the  sensitivity  system  by  a direct  analysis  of  con- 
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trollability  of  the  matrix  pair  (A,  B).  By  analyzing  linear  dependence 
of  vectors  in  the  controllability  matrix  of  (A,  B)  they  obtain  sufficiency 
conditions  for  uncontrollability.  They  use  the  linear  dependency  of 
column  vectors  in  the  controllability  matrix  of  (A,  B)  to  provide  a con- 
structive means  to  reduce  the  number  of  different!  tO.  equations  to  the 
minimum  number,  and  remove  some  of  the  undesirable  computational 
featxires  associated  vith  reduction  of  the  sensitivity  system  via  canonical 
form  analysis  (Ref  Uo) 

An  area  which  has  also  used  the  structural  relationships  of  the 
sensitivity  system  for  analysis  is  the  question  of  "insensitivity".  (Refs 
5^-67)  Various  definitions  of  insensitivity  have  been  utilized,  and 
Guardabassi  et  al,  (Ref  61)  provide  an  excellent  summary  of  the  various 
definitions  and  conditions  for  insensitivity.  Also  they  show  the  rela- 
tionships between  parametric  insensitivity  and  the  related  concepts  of 
signal  invariance  (e.g‘.,  (Ref  57)),  signal  insensitivity  (e.g.,  (Ref  6U)), 
and  parametric  Invariance  (Ref  5^)).  All  of  these  concepts  are  con- 
cerned with  system  conditions  which  ensure  that  parametric  or  signal 
perturbations  cause  no  change  in  the  system  output.  Insensitivity 
generally  connotes  zero  effect  for  local  or  small  changes  of  the  parameters, 
whereas  the  property  of  invariance  is  generally  concerned  with  the  stronger 
condition  of  zero  output  perturbation  for  larger  changes  of_  the  parameters . 
We  will  later  be  concerned  with  the  weaker  condition  of  parametric 
insensitivity,  but  some  recent  work  of  Guardabassi,  et  al  (Ref  62)  shows 
that  insensitivity  of  a suitably  defined  associative  system  can  often 
imply  invariance  of  the  originaD.  system.  The  conditions  for  Insensitivity 
are  well-known  (e.g., (Ref  61)),  and  recent  research  has  concentrated  on 
the  geometric  construction  of  feedback  control  laws  which  achieve  insensi- 
tivity (eg.,  (Ref  55)  (Bef  58)  (Hcf  59)  (Ref  65),.  We  do  not  delve  into  this 
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an  interesting  alternative  to  the  nonlinear  matrix  Riccati  differential 
equation  method  of  solution. 

Closed-loop  or  feedback  control  laws  cure  known  often  to  have  smaller 
sensitivity  than  the  nominally  equivalent  open-loop  control  law  (e^, 

Horowitz  (Ref  1**),  Cruz  and  Perkins  (Ref  7)).  Therefore,  there  has  also 
been  a considerable  amount  of  research  into  the  design  of  feedback  control 
laws  which  minimize  parameter  sensitivity  by  using  both  classical  frequency 
domain  and  optimal  control  techniques.  For  the  linear  time-invariant  system 
(k)  the  Riccati  equation  solution  does  not  produce  an  optimeJ.  closed-loop 
minimum  sensitivity  control  law  because  the  optimal  control  is  based 
upon  the  nominal  values  of  the  peurameter  sensitivities,  which  are  com- 
puted under  the  assumption  that  the  control  is  open-loop.  However, 
there  has  been  quite  a number  of  attempts  to  design  suboptimal  minimum 
sensitivity  feedback  control  laws  based  upon  this  Riccati  equations 

structure.  (See,e.g,  Lament  and  Kahne  (Ref  72)).  Also,  there  is  con- 

0 

siderable  interest  in  the  design  of  minimum  sensitivity  control  laws  •vdien 
there  is  noise  entering  into  the  system  as  well  as  parameter  variations 
(e.g.,  (Ref  77))  or  when  the  control  law  is  adaptively  adjusted  by  using 
on-line  estimates  of  the  parameter  values  (e.g.  Ref  7**)).  However,  to 
limit  the  scope  of  our  presentation  we  only  consider  the  open-loop  problem 
and  do  not  discuss  the  closed-loop  adaptive  control  problems. 

c.  Peu-ameter  Identification  and  Smsltlvlty 

The  problem  of  estimating  the  parameters  in  a dynamic  system  can  be 
formulated  as  an  optimization  problem  in  which  the  parameter  values  are 
selected  to  either  minimize  or  maximize  some  criterion.  Most  often  this 
criterion  involves  the  difference  between  the  measured  system  output  and 
the  model  system  output,  where  the  model  output  is  dependent  upon  the 
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One  of  the  important  questions  of  parameter  identification  is  whether 
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or  not  the  parameters  in  a system  model  can- be  uniquely  determined 
from  a given  set  of  experimental  observations.  This  question  is  the 
essence  of  what  is  called  "identifiabllity"  (Lee  (Ref  18)).  If  unique 
estimates  of  the  parameters  cannot  be  obtained  (the  system  is  not  identi- 
fiable), then  either  the  mathematical  model  or  the  experiment  itself  must 
be  modified.  Because  it  is  such  a basic  question  and  because  the  research 
on  parameter  identification  is  so  extensive,  the  literature  on  Identifi- 
ability  is  equally  diversified. 

Tse  and  Anton  (Ref  103),  Aoki  and  Yue  (Ref  88),  Staley  and  Yue 
(Ref  101 ),  Tse  (Ref  102)  and  other  have  used  a stochastic  definition  of 
identifiabllity  in  which  identifiabllity  is  defined  by  whether  or  not 
there  exists  an  estimator  which  gives  convergence  in  probability  (or  in 
mean  square)  to  the  true  parameter  value.  The  concept  of  stochastic 
identifiabllity  is  then  stronger  than  the  deterministic  view  of  identifi- 


ability  discussed  in  the  above  paragraph  (vinlqueness  of  solution),  and 
deterministic  identifiabllity  is  necessary  but  not  sufficient  for 
stochastic  identifiabllity  when  there  is  system  measurement  and/or  system 
plant  noise  (Tse  and  Anton  (Ref  103)).  For  linear  time-invariant  systems 
with  Gaussian  white  measurement  noise,  the  conditions  of  stochastic 
identifiabllity  lead  to  the  conclusion  that  the  local  information  matrix 
must  be  positive  definite  to  ensure  local  identifiabllity  (Tse  (Ref  102)). 

On  the  other  hand,  many  researchers  ^.g.,  Lee  (Ref  Id)),  Bellman  and 
Xstrom  (Ref  89),  Glover  (Ref  91)  (Ref  92),  Glover  and  Willems  (Ref  93), 
and  Martenson  (Ref  96))  have  adopted  a deterministic  approach  to  Identifi- 
abllity similar  to  that  discussed  in  the  first  paragraph  above  and  this 
is,  in  fact,  the  approach  which  we  will  take.  In  addition  to  conditions 

equivalent  to  the  Information  matrix  being  positive  definite,  these 
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deterministic  approaches  have  provided  considerable  structural  information 
concerning  identifiability  in  linear  time-invariant  ordinary  differential 

o 

equation  systems.  Bellman  and  Astrom  (Ref  89),  Mehra  (Ref  98),  Mayne 
(Ref  97),  Denham  (Ref  90),  and  others  examine  conditions  under  which 
certain  canonical  forms  will  be  identifiable  regardless  of  the  nominal 
parameter  values.  Mehra  (Ref  98)  gives  an  excellent  sunnnary  of  these 
conditions . 

However,  if  the  parameters  of  the  mathematical  model  are  physical 
variables  it  may  be  inconvenient  to  utilize  such  canonical  form  repre- 
sentations for  identification.  Additionally,  Glover  and  Willems  (Ref  93) 
point  out  some  inconsistencies  which  can  arise  in  using  such  canonical 
forms.  Therefore,  using  arbitrary  parnnieterizations  they  consider  condi- 
tions for  both  local  and  global  identifiability  from  the  system  frequency 
domain  transfer  function  (zero  initial  conditions).  Then  in  reference  92 
Glover  extends  the  worK  of  Popov  (Ref  100)  to  consider  general  structural 
conditions  for  minimal  and  nonminal  parameterization  of  linear  time-invariant 
systems.  He  relates  these  conditions  to  structural  identifiability  in  such 
systems,  and  shows  that  the  parameter  identification  problem  may  become  ill- 
posed  if  the  parameterization  is  nonminimal. 

Martenson  (Ref  96)  also  uses  a deterministic  approach  to  identifiability 
By  extending  some  concepts  of  Silverman  and  Meadows  (Ref  25-)  concerning 
sufficient  conditions  for  observability  in  linear  time-varying  systems , 
he  obtains  general  algebraic  sufficiency  conditions  for  local  identifi- 


ability in  both  linear  and  nonlinear  dynamic  systems. 

Finally,  in  the  next  section  we  will  discuss  the  fact  that  the 
control  input  strongly  affects  the  quality  of  parameter  identification 
capability,  and,  indeed,  the  control  input  may  affect  whether  or  not 
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the  system  parameters  are  even  identifiable.  Astrom  and  lykhoff  (Ref  80) 
term  such  conditions  on  the  control  input  as  conditions  for  "persistent 
excitation".  Through  frequency  domain  analysis  (steady-state  conditions) 
Mehra  (Ref  98),  Payne  and  Goodwin  (Ref  99)»  Hoberock  and  Stewart  (Ref  9^) 
and  others  have  examined  this  question  in  considerable  detail  and  have 
determined  the  nianber  of  frequency  components  which  are  necessary  to 
ensure  Identifiability  in  steady  state  operation.  This  interaation 
between  identification  and  the  control  input  is  extremely  Important.  Later 
we  examine  identifiability  of  the  zero-state  response  in  linear  time- 
invariant  systems,  and  general  structural  conditions  for  identifiability 
are  obtained.  However,  these  structiural  conditions  are  only  necessary,  as 
sufficiency  is  dependent  upon  control  input  conditions  for  "persistent 
excitation". 

e.  Sensitivity  Design  for  Optimal  Parameter  Identification 

The  relationship  between  the  control  input  function  and  parameter  esti- 
mation capability  has  long  been  recognized.  A considerable  number  of 
papers  have  appeared  on  various  aspects  of  designing  control  Inputs  to 
optimize  or  improve  parameter  estimation  capability,  and  these  techniques 
have  been  successfully  applied  to  many  practical  problems.  fe.g.,  (Refs  lOU  - 
122))  Mehra  (Ref  llU)  gives  an  excellent  summary  of  the  literature  on 
input  design,  discussing  methods  of  solution  In  both  the  frequency  domain 
and  the  time  domain.  Rather  than  repeat  this  general  discussion,  we 
will  discuss  only  the  work  which  is  directly  relevant  to  our  approach. 

Our  approach  is  entirely  motivated  by  the  novel  work  of  Mehra  (Ref  112). 
He  shows  that  the  optimal  input  function  to  maximize  a weighted  trace  of 
the  information  matrix  for  linear  time-invariant  ordinary  differential 
equation  systems  is  the  eigenfunction  corresponding  to  the  largest  eigen- 
value of  a positive  self-adjoint  operator.  He  suggests  various  methods 
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of  solution  including  the  Rayleigh-Ritz-Galerkin  method.  This  is  the 
solution  method  we  later  adopt  for  our  operator  approach,  hut  Mehra  concen- 
trates on  a method  which  transforms  the  problem  into  a two-point  boundary 
value  problem  using  the  Pontryagin  Maximum  Principle  (Ref  22).  Using  the 
sensitivity  system  differential  equations  he  suggests  solving  this  problem 
by  means  of  a matrix  Riccati  differential  equation  technique.  In  (Ref  120) 
a computer  algorithm  using  this  Riccati  equation  approach  is  outlined,  and 
the  solution  method  is  applied  to  the  determination  of  optimal^  input  func- 
tions for  the  identification  of  aircraft  stability  and  control  derivatives. 

This  solution  method  is  then  further  refined  by  Gupta  and  Hall  (Ref  108) 
who  eliminate  much  of  the  vindesirable  features  of  numerical  integration  of 
the  nonlinear  matrix  Riccati  differential  equation.  They  also  incorporate 
the  controllability  properties  of  the  sensitivity  system  to  reduce  the 
amount  of  computations  still  further.  The  computational  aspects  of  this 
solution  method  will  be  discussed  further  in  Section  VII  and  compared 
with  the  new  solution  technique  which  is  developed  using  the  operator 
description  of  the  parameter  sensitivities  and  the  Rayleigh-Rltz-GeQerkin 
method. 

We  comment  that  a recognized  shortcoming  of  the  original  approach  of 
Mehra  (Ref  112)  is  the  use  of  the  weighted  trace  of  the  information  matrix 
as  the  optimization  criterion.  Unless  the  weighting  matrix  is  selected 


Judiciously,  this  optimization  criterion  can  lead  to  erroneous  results  and 
even  to  conditions  in  which  the  local  information  will  be  singular  (the 
system  will  be  nonldentlflable) . (See,e.g.,  Reid  (Ref  119),  Mehra  (Ref  ll^*), 
Zarrop  and  Goodwin  (Ref  122)).  Others  have  suggested  that  either  the  trace 
of  the  inverse  inf  rmatlon  matrix  (e.8.,  Reid  (Ref  119),  Gupta  and  Hall 
(Ref  108),  and  Goodwin,  et  al  (Ref  107))  or  the  determinant  of  the  inverse 
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Information  matrix  (e.g.,  Mehra  (Ref  lll»)  and  Nahi  and  Napjus  (Ref  117))  is  a 
better  criterion  for  optimization.  Through  the  Cramer-Rac  lower  bound 
the  former  criterion  is  directly  related  to  the  variai’ce  of  the  minimxim 

! 

j estimation  error  whereas  the  latter  is  directly  proportional  to  the  volume 

I 

; of  the  error  ellipsoid  of  parameter  estimation  error.  (See  Nahi  and  Napjus 

(Ref  117)  for  further  detailed  discussion  of  various  optimization  criteria 
useful  in  the  input  design  problem) . 

: However,  through  some  results  recently  obtained  by  Mehra  (Ref  113), 

both  Mehra  (Ref  llU)  and  Gupta  and  Hall  (Ref  108)  derive  Iterative  algorithms 
to  successively  adjust  the  weighting  matrix  in  the  information  matrix 
criterion  so  that  the  resulting  optimeil  input  minimizes  either  the  trace 


or  the  determinant  of  the  inverse  of  the  information  matrix.  The  original 


resvilts  of  Mehra  (Ref  112)  are,  therefore,  an  integral  part  of  these 
iterative  algorithms.  Indeed,  the  maximization  of  the  weighted  trace  of 

I 

the  information  matrix  is  the  most  time  consuming  part  of  these  algorithms. 
Therefore,  although  our  approach  is  closely  related  to  reference  112  in 
using  the  weighted  trace  of  the  information  matrix,  it  has  application  to 
other  optimization  criteria  through  these  more  recent  results. 

2.  OBJECTIViiS  AND  ORGANIZATIONS 

There  are  two  major  objectives  in  this  research  and  the  presentation 
of  this  report.  The  first  is  to  demonstrate  a general  operator  framework 
for  treating  parameter  sensitivity  in  linear  dynamic  systems  and  to  show 
the  usefulness  of  this  operator  formulation  for  analyzing  various  sensitivity 
related  system  properties.  The  second  objective  is  to  utilize  this  operator 
formulation  to  obtain  algorithms  and  computational  techniques  which  will 
be  useful  to  systems  engineer^  dealing  in  modeling,  identification  and 
control.  In  particular,  oxir  specific  computational  results  mainly  emphasize 
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large  scale,  linear,  multivariable,  time- invariant  ordinary  differential 
equation  systems. 

To  help  achieve  the  first  objective,  concepts  are  initially  developed 
in  a very  general  operator  framework,  and  then  these  basic  concepts  are 
illustrated  by  specific  application  to  linear  ordinary  differential  equation 
systems  with  emphasis  on  the  time-invariant  system,  llierefore,  faction  IT 
defines  parameter  sensitivities  for  a linear  system  described  in  terms  of 
the  s\jm  of  bounded  linear  operators  on  the  Hilbert  space  of  initial  condi- 
tions and  the  Hilbert  space  of  control  inputs.  Once  defining  these 
sensitivities  in  terms  of  the  Frechet  partial  derivatives  of  the  system 
operators,  we  discuss,  in  turn,  the  sensitivity-related  system  properties 
on  insensitivity,  sensitivity  controllability,  open-loop  minimum  sensitivity 
control  design,  parameter  identification,  identifiability,  and  sensitivity 
design  to  optimize  parameter  estimation  capability.  These  system  properties 
are  chosen  for  discussion  not  only  because  of  their  system  importance,  but 
also  because  of  their  direct  dependence  upon  the  operator  description  of 
the  parameter  sensitivities. 

Section  II  establishes  the  fundamental  operator  methodology  and  many 
of  the  basic  definitions  which  are  subsequently  used  to  consider  parameter 
sensitivity  in  linear  ordinary  differential  equation  systems  (Section  III  - 
VII).  This  general  operator  presentation  allows  an  appealing  compactness 
of  notation  in  the  development  concepts,  and  it  is  wi’itten  under  the 
assumption  that  the  reader  has  a basic  familiarity  wltn  normed  linear 
spaces  and  some  of  the  elementary  concepts  of  functional  analysis  and 
linear  operator  theory.  Luenberger  (Ref  20)  is  an  excellent  background 
reference  for  nearly  all  of  the  material  which  is  discussed.  However,  the 
reader  who  lacks  this  mathematical  background  should  not  be  discouraged 

from  reviewing  Section  II,  ns  this  section  contains  much  of  the  physical 
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and  geometric  motivation  for  the  properties  with  which  we  deal.  Bather, 
such  a reader  is  encouraged  to  skim  the  mathematics  lightly  while  paying 
closer  attention  to  the  general  discussion  and  definitions. 

Section  III  develops  a general  operator  description  of  the  parameter 
sensitivities  for  linear  ordinary  differential  equation  systems.  In  such 
systems  the  key  to  making  the  operator  formulation  a useable  approach  lies 
in  taking  the  partial  derivative  of  the  state  transition  matrix  with 
respect  to  the  parameter  components.  General  results  are  obtal,ned  for 
time-varying  plant  matrices.  Then  for  time-invariant  plant  matrices  a new 
algebraic  description  of  these  partial  derivatives  is  derived.  This 
description  gives  considerable  structural  insight  and  produces  practical 
computational  alternatives  to  conventional  "sensitivity  system"  differential 
j equation  approaches. 

I 

I In  Section  IV  the  geometric  aspects  of  this  representation  are  pursued 

[ by  considering  the  properties  of  insensitivity,  sensitivity  controllability, 

J and  parameter  Identlfiability.  The  operator  approach  of  Section  III  is 

utilized,  and  new  general  algebraic  conditions  are  obtained  for  local 
and  structural  identlfiability. 

The  computational  aspects  of  this  algebraic  representation  are 
next  examined  by  considering  a quasilinearization  algorithm  for  parameter 
identification  (Section  V),  optimal  open-loop  control  design  for  minimum 
sensitivity  (Section  VI),  and  sensitivity  design  for  optimaf  parameter 
identification  (Section  VII).  In  each  ease,  detailed  algorithms  are 
presented,  and  the  computational  aspects  of  these  methods  are  compared 
with  conventional  procedures  using  the  sensitivity  system  and  the  Maximum 
Principle.  To  help  illustrate  the  computational  techniques  some  simple 
examples  are  presented  within  the  respective  sections. 
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Section  II 


LINEAR  SYSTEMS  ON  A HILBERf  SPACE 


In  this  section  parameter  sensitivity  operators  and  their  system 
properties  are  considered  for  a very  general  linear  system,  S^.  This 
system  Is  defined  in  terms  of  bounded  linear  operators  on  real  Hilbert 
spaces.  For  notatlonal  convenience  and  to  simplify  the  discussion,  only 
the  system  output  and  output  sensitivity  operators  are  discussed;  how- 
ever, a similar  operator  description  of  the  system  state  and  state 
sensitivities  could  also  be  provided. 

The  intent  of  this  section  Is  to  help  motivate  the  use  of  an  oper- 
ator representation  for  the  treatment  of  parameter  sensitivity  In  general 
linear  systems.  This  discussion  provides  the  foundation  for  our  later 
analysis  of  parameter  sensitivity  In  linear  ordinary  differential  equation 
systems  (Sections  III  - VII).  However,  besides  providing  a unifying 
framework  for  this  later  discussion.  It  Is  Important  to  note  that  this 
general  treatment  should  facilitate  the  extension  of  these  sensitivity 
concepts  to  such  linear  systems  as  general  hereditary  differential  systems 
and  parabolic  partial  differential  equation  systems.  These  systems  can  be 
put  In  an  operator  - Hilbert  space  setting  (see,  eg,  Delfour  and  Mltter 
(Ref  9}  (Ref  10)  or  Baras,  et  al,  (Ref  1)),  and  such  mathematical  models 
are  beginning  to  play  an  Increasingly  Important  role  In  applications  of 
systems  theory  to  chemical,  biological,  economical,  and, other  large  scale, 
complex  systems. 

The  organization  of  this  section  Is  the  following:  In  Section  II. 1 

the  Hllbert-spacc  system  S is  defined  and  then  in  Section  II. 2 some 

K 

notatlonal  conventions  are  described.  In  Sections  II. 3 and  II. 4,  and 


\ 


\ 
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II. 5 the  control  related  properties  of  insensitivity,  sensitivity 
controllability,  and  optimal  design  for  minimum  sensitivity,  respectively, 
are  defined  and  discussed.  Finally,  in  Sections  II. 6,  II. 7 and  II. 8, 
identification  related  sensitivity  properties  are  described.  In  partic- 
ular, these  sections  discuss  general  identification  algorithms,  identlfi- 
abillty,  and  optimal  sensitivity  design  for  improved  identification, 
respectively. 

1.  SYSTEM  DEFINITION 

The  real  m-dimenslonal  mathematical  model  output  of  the  system  S^^  is 
uniquely  defined  on  the  closed  interval  [t^,  t^]  by  the  operator  expres- 
sion 


y(t;  b,  u)  = T(t;  b)d(b)  + W(t;  b)u  telt^,  t^] 


(8) 


where  d(b)  is  the  initial  state  of  the  system  from  the  Hilbert  space  D; 
u e L2(t^,  t^;  U)  is  the  control  function  input  over  the  interval  [t^,  t^] 
and  U is  the  Hilbert  space  of  controls;  T(t;  b)  c L^(D,  R™)  is  the 
continuous  zero-input  linear  operator;  W(t;  b)  e t;  U),  R**) 

is  the  casual,  continuous  zero-state  linear  operator;  and  b e is  a 
p-dlmensional  real  parameter  vector  which  parameterizes  the  initial  state 
and  both  the  zero-input  and  zero-state  system  operators.  The  components 
of  the  parameter  vector  b are  designated  b^,  1 “ 1,  2,  ...p,  and  the 
nominal  a priori  value  of  the  b e is  designated  b^  e R*^.  The  mathe- 
matical model  will  generally  be  assumed  evaluated  at  the  nominal  param- 
eter values;  therefore,  for  convenience  we  will  henceforth  delete  from 
the  notation  the  explicit  dependency  of  the  output  and  various  system 
operators  upon  b^,  and  this  dependency  will  only  be  shown  if  b j*  b^  or 
if  it  is  required  for  some  reason  of  emphasis. 
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At  the  nominal  b e and  for  each  t e [t  . t,]  It  Is  assumed  that 

O 0 1 

d,  T(t)  and  W(t)  are  continuously  Frechet  differential  with  respect  to 
each  parameter  component  1=1,  2,  ...p.  These  partial  derivatives 
are  denoted  ^^^(t),  and  W^j(t).  Since  these  partlals  are 

continuous,  the  total  Frechet  differential  Is  (Ref  3:  195) 

P 

d^^  Ab  = r d^^jAb^  (9) 

1-1 


i 

t 

i 


where  Ab  - b - b and  similarly  for  T,  (t)  and  W.  (t) . 

O D b 

The  component  sensitivity  operators  are  defined  as  the  Frechet  partial 
derivatives  of  the  system  output  with  respect  to  each  parameter  component, 
b^,  v^^^(t;  u)  = u) . Because  of  our  assumptions  these  become^ 

v^^^(t;  u)  = (T^)(t)d  + T(t)d^^j)  + W^^j(t)u  (10) 


The  total  sensitivity  operator  is  denoted  by  the  partitioned  operator 

V(t;  u)  = [v^^^(t;  u)  v^^^(t;  u)  . . . v^P^(t;  u)]^p  (11) 

so  that  the  sensitivity  "differential"  becomes 


P (1) 

V(t;  u)  Ab  - I u)  Ab^^  (12) 

1-1 

As  a final  comment,  since  we  are  dealing  with  a linear  system,  we 
are  assured  that  we  may  uniquely  decompose  the  output  of  the  system  into 
its  zero-input,  y , (t)  - T(t)d,  and  zero-state  portion, 
y (t;  u)  - W(t)u.  (Ref  33)  The  parameter  sensitivities  may  be 

Z • 8 « 


^Notice  that  this  "chain  rule"  for  the  derivative  applies  because  the 
individual  partial  derivatives  are  continuous  and  because  b^  e R. 

(Ref  3:  187-188).  If  b were  assumed  to  be  from  a general  normed  linear 

space,  then  a derivative  in  this  form  would  not  apply. 
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similarly  decomposed,  and  we  will  use  such  a decomposition  frequently 
In  the  discussion  and  the  Investigation  of  the  various  sensitivity 
related  system  properties. 

2.  MOTATIOKAL  CONVENTIONS 

We  will  often  be  Interested  in  the  system  output  and  output  sensi- 
tivities over  the  entire  time  Interval  [t  , t-].  The  space  of  output 

o I 

functions  over  this  Interval  will  be  denoted  V >•  L-(t  , t,;  R™).  An 

I o t 

element  from  this  function  space  will  be  represented  by  y(*)  oi*  y.  If 

the  time  Interval  is  restricted  to  [t  , t],  then  the  function  space  will 

o 

be  denoted  Similar  script  notations  td.ll  be  utilized  for  the  control 

function  spaces,  U L-(t  ,t-;  U)  and  U “ L-(t  , t;  U) . 

Z o ^ t z o 

In  the  discussion  of  the  sensitivity-related  properties  the  adjoint 

operator  is  frequently  encountered.  The  adjoint  of  a linear  operator 

will  be  denoted  by  that  linear  operator  with  a superscript  For 

example,  the  adjoint  of  T(t)  e L (D,  R™)  is  denoted  by  T*(t)  e L (r”',D), 

c c 

and  Is  defined  by  the  Inner  product  relation 

'[y(t)/T(t)d]  - lT*(t)y(t)/d]  (13) 

It  will  be  assumed  that  the  reader  has  a basic  familiarity  with  the 
adjoint  operator  and  Its  properties,  and  Luenberger  (Ref  20)  Is  recom- 
mended as  an  excellent  background  reference.  However,  the  following 
relationship  Is  used  so  often  that  It  Is  stated  here  for  convenience: 
LEMMA  1 

Let  A c L^(H,K)  where  H and  K are  real  Hilbert  spaces.  Then 


1) 

(R  (A))'^  - W(A*) 

(lA) 

11) 

R(A)  - (V  (A*))-^ 

(15) 
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See  Luenberger  (Ref  20;  157)  for  the  proof  of  this  lenmia.  Note 

that  R(*)  and  W(*)  denote  the  range  and  null  spaces  of  the  linear  operator, 
the  superscript  ^ Is  the  orthogonal  complement  space,  and  the  overbar 
here  denotes  closure  of  the  subspace.  Also  note  that  many  of  our  later 
manipulations  with  the  adjoint  operator  are  simplified  considerably 
because  we  have  assumed  the  underlying  spaces  to  be  real  Hilbert  spaces, 
and  such  spaces  are  dual  to  themselves. 

We  now  proceed  with  the  development  of  the  various  senslttvlty- 
related  system  properties.  In  developing  these  properties  the  following 
first-order  Taylor's  series  equation  for  the  output  function  y(b^  + Ab) 
plays  a fundamental  role: 


y(b  + Ab)  - y(b^)  + V Ab  + 


P (1) 

- y(b^)  + E v^^^  Ab^  + 
' 1-1 


where 


ita,  lU  ll/||ibll -0 
||Ab||  -^0 

It  will  generally  be  assumed  that  design  Is  accomplished  at  a nominal 
parameter  value,  b^  e R^,  but  that  the  true  parameter  value,  b e R^ 

(not  necessarily  equax  to  b^).  Is  unknown.  The  first  three  system  prop- 
erties discussed  are  parameter  insensitivity,  sensitivity  controllability, 
and  optimal  cnen-loop  control  design  for  minimum  sensitivity.  Each  of 
these  Is  ’'elated  to  making  the  sensitivity  operator,  V,  small,  and  hence 
all  have  the  general  objective  of  reducing  the  perturbation  In  system 
response 


Ay  - y(b^  + Ab)  - y(bp) 
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caused  by  small  changes  of  the  system  parameter,  Ab  ~ b - b^.  On  the 
other  hand,  the  last  three  properties  discussed  are  system  parameter 
Identification,  Ident If lability  and  sensitivity  design  for  Improved 
parameter  estimation;  here  It  will  generally  Improve  parameter  Identi- 
fication capability  If  the  sensitivity  operators  are  large  rather  than 
small.  The  Taylor's  series  relationship  Is  used  In  developing  each  of 
these  sensitivity  concepts,  and  the  operator  formulation  Is  used  as  a 
unifying  foundation  for  the  discussion. 

3.  PARAMETER  INSENSITIVITY 

Background  literature  on  parameter  insensitivity  was  discussed  in 

Section  I.l.a.  Few  specific  results  can  be  made  at  this  level  of 

generality  (for  the  Hilbert  space  system  S ),  and  so  the  objective  here 

rl 

Is  merely  to  establish  the  definitions  which  will  be  used  In  Section  IV 
for  obtaining  algebraic  insensitivity  conditions  In  linear  time-invariant 
ordinary  differential  equation  systems.  The  main  difference  between 
these  definitions  and  previous  ones  is  the  separation  between  zero-input 
and  zero-state  insensitivity^ ; 

Definition  1 

The  system  S is  b.  — component  zero-input  insensitive  if 
n X 

v^^l  (t)  ■ 0 for  all  t c [t  , t-].  It  Is  totally  zero-input  insensitive 
if  this  Is  true  for  all  1 1,  2,  ...p. 

Definition  2 

The  system  S Is  b — component  zero-state  insensitive  if 
H 1 

v^^^  (t;  u)  ••  0 for  all  t e [t  , t,]  and  all  u.  It  is  totally  zero-state 
z.  s.  or 

Insensitive  if  this  Is  true  for  all  1 1,  2,  ...p. 

'Note  that  zero-state  Insensitivity  Is  what  Guardabassl,  et  al,  (Ref  61) 
refer  to  as  "hypoinsensltivlty". 
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4.  SENSITIVITY  CONTROLLABILITY 

A central  topic  of  modern  system  theory  Is  controllability  (e.g., 
Kalman,  et  al,  (Ref  17)).  Controllability  properties  of  the  "sensitivity 
system"  in  linear  time-invariant  ordinary  differential  equation  systems 
has  been  an  area  of  considerable  research  because  of  the  relation  to 
making  the  sensitivities  identically  zero  at  the  terminal  time  and  for 
producing  minimal  order  differential  equation  models  of  the  sensitivities 
(see  Section  I.l.a  for  a discussion  of  the  relevant  background  literature). 
We  treat  sensitivity  controllability  for  such  systems  in  Section  IV. 3. 
Therefore,  like  the  previous  section,  our  main  purpose  here  is  to  estab- 
lish the  definitions  which  will  later  be  used  to  help  obtain  practical 
computational  results  in  Section  IV. 3. 

To  conform  with  our  operator  treatment  of  the  output  parameter 
sensitivities,  we  define  sensitivity  controllability  in  terms  of  whether 
or  not  the  zero-state  operators  map  the  control  function  space  U onto  the 
output  space  R™.  (See,  e.g.,  Weiss  (Ref  32)  or  Delfour  and  Mltter 
(Ref  10).)  Recall  that 

v^^^(t;  u)  = vf^^  (t)  + W.  . (t)  u (18) 

z.l.  (1) 

where 


v^^^(t;  u)  e r”  and  W,,.(t)  e L (U  , r”*)  . 

(1;  c t 

Therefore,  if  (t)  maps  onto  r"*,  for  any  given  value  of  v^^\t;  u)  e R*” 
there  exists  a control  u c U such  that 

(v^^^t;  u)  - (t))  - W^^j(t)  u,  (19) 

and  so  the  sensitivity  v^^^  is  termed  "controllable"  at  time  t e [t^,  . 
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Sensitivity  controllability  has  previously  been  examined  for  linear 
time-invariant  ordinary  differential  equation  systems,  and  generally  It 
has  been  discussed  In  the  context  of  controllability  (or  uncontrollability) 
of  the  augmented  "sensitivity  system",  equation  (5).  Therefore,  we  define: 
Definition  3 

The  system  S Is  output  sensitivity  controllable  at  time  t e [t  , t,] 
n or 

if  the  "partitioned"  operator 


W(t)  i 


maps  onto  the  product  space  . 

Definition  4 

The  system  S Is  output  sensitivity  uncontrollable  if  there  exists 
n 


no  t e [t^,  tj]  such  that  W(t)  maps  onto  ^ 

Since  the  output  space  is  finite  dimensional,  output  sensitivity 

controllability  may  be  determined  from  the  m(p  + 1)  x m(p  + 1)  symmetric 

"sensitivity  controllability  matrix",  W(t)W*(t),  where 

W*(t)  e L (r”^P  L,(t  ,t;U))  is  the  adjoint  of  W(t).  This  well- 

c i o 

known  result  is  reiterated  in  the  following  lemma  (Ref  10): 

Lemma  2 

i)  is  output  sensitivity  controllable  at  t e [t^,  t^]  if  and 
only  if  W(t)W*(t)  is  invertible. 

11)  The  sensitivity  controllable  subspace  of  R™^'^  ^ is 
R(W(t))  - R(W(t)W*(t))  (21) 
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For  simplicity,  we  assume  that  we  wish  to  minimize  the  quadratic 
functional 


= fY(u)/SY(u)]  + [u/u] 


'-f  '■f 

/ [Y(t;u)/S(t)Y(t;u)]dt  + / [u(t)/u(t) )dt 


(22) 


where  Y(u)  e ^2^^o’  ^ augmented  output  vectof  function 


Y(u) 


y(u) 

v^^^(u) 


(u) 


(23) 


and  the  matrix  S(t)  Is  assumed  to  be  non-negative  definite  and  symmetric 

for  each  t e [t  , t,]. 

o f 

We  may  write  Y(i'''  e L-(t  , ^ ^^)  as 

/ O £ 


Y(u)  = Y . + Y (u)  - Y . + Wu 

z.l.  z.s.  z.l. 


(24) 


where  W Is  defined  by  equation  (20).  Denoting  the  adjoint  operator  of  W 
as  W*  then  the  cost  functional  J_(u)  takes  the  form 

J (u)  - [u/(W*SW+  r)u]  + 2 [u/W*Y  , ] + [Y  . /Y  . ] (25) 

Ojj  z.l.  z.l.  z.l. 

since  S Is  assumed  positive,  the  self-adjoint  operator 


(W*SW  + I)  e L (ti,U) 
c 


Is  Invertible.  Therefore,  the  unique  minimizing  control  Is  given  by 


(26) 


u*  - -(W*SW  + I)"^  V?*Y 


z.l. 
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(Ref  23).  However,  it  would  generally  be  difficult  to  obtain  the  in- 
verse of  the  operator  (W*SW  + I)  directly^,  and  so  for  the  "open-loop" 
problem  some  Iterative  solution  technique  may  be  applied.  The  gradient 
of  J_(u)  is 


VJ„(u)  = 2(W*SW  + I)u  + 2W*Y  . 

Z • 3.  • 


(28) 


and  so  either  a gradient  or  conjugate  gradient  algorithm  might  be 
utilized  to  Iteratively  compute  the  optimal  u*eU.  (e.g.,  Luenbetger 
(Ref  20) . ) 

In  Section  VI,  we  apply  this  operator  method  of  solution  to  linear 
ordinary  differential  equation  systems.  Computational  considerations 
as  well  as  some  variations  of  this  basic  problem  will  there  be  discussed. 

6.  SENSITIVITY  OPERATORS  IN  PARAMETER  IDENTIFICATION 

The  remaining  system  properties  which  we  investigate  are  system 
parameter  identlf lability  and  "input  design"  for  Improved  parameter 
identification.  To  motivate  these  topics  this  section  briefly  discusses 
the  fundamental  role  which  the  sensitivity  operators  play  in  parameter 
identification. 

a.  Output  Error  Functional 

Consider  the  system  S„  and  assume  that  b c r'^  is  an  unknown  constant 

n 

parameter  vector  which  we  wish  to  estimate  based  upon  a measured  system 


^f  the  Inverse  of  (W*SW  + I)  were  to  be  obtained  (for  example,  by  Riccatl 
equation  techniques  - see  Section  VI)  then  expression  (27)  for  u*  has  the 
form,  at  least,  of  a "closed-loop"  control  law.  However,  since  the  aug- 
mented vector  Y is  a functlv»n  of  the  a priori  zero-input  sensitivities 

(which  are  computed  under  the  assumption  that  the  control  is  "open-loop"), 
this  minimum  sensitivity  optimal  control  cannot  be  made  "closed-loop".  This 
is  a well-known  dilemma  (e.g..  Price  and  Deyst  (Ref  75)),  and  is  discussed 
further  in  Section  VI. 
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output  function  which  we  denote  by  y c V.  The  control  Input,  u.  Is 
assumed  to  be  fixed  and  completely  known.  Therefore,  to  emphasize  that 
the  control  Is  fixed.  In  this  section  and  In  Section  II. 7,  we  will  delete 
the  explicit  dependency  of  the  system  output  upon  the  control,  u.  We  let 
b^  denote  an  a priori  estimate  of  b e R , and  y(l>^)  e " denotes  the 
corresponding  mathematical  model  output  function.  The  system  sensitiv- 
ities, 1 ••  1,  2,  ...p,  are  then  assumed  to  be  evaluated  along  the 

mathematical  model  output,  and  they  are  assumed  to  exist  and  be  continuous 


f 


s 

i 

r 

r 


for  all  b e R^. 

The  estimation  criterion  which  we  adopt  Is  the  commonly  used  (e.g., 
(Ref  80))  output  error  functional 

Jg(b)  “ [y  - y(b)/y  - y(b)]  (29) 

The  best  estimate,  b*.  Is  then  the  b which  minimizes  Jg(fi).  Note  that 
other  more  generalized,  cost  functionals  might  be  selected  In  order  to 
account  for  measurement  noise  or  system  process  noise.  This  generali- 
zation Is  considered  In  Section  II. 6. d,  and  It  does  not  change  the  basic 
approach. 


i 


b.  Linearized  Measurement  Equation  and  Quaslllnearlzatlon 
Since  the  existence  and  continuity  of  the  sensitivity  operators  at 

A 

b^  are  assumed,  we  may  linearize  the  true  system  output,  y,  .About  the 

A 

nominal  output,  y(b^)»  through  the  generalized  Taylor’s  formula  as 
follows : 


y ■ y(^  ) + VAb  + e, 
o X 

- y6p)  + r b^  + 

1-1 

where  Ab  - b - S and 


(30) 
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li®  I Ujl  1 y/\ lAbj  1 - 0 (31) 

llAbll  H.  0 

Defining  the  output  error 

2 ■ y - y(b^)  ^32) 

we  obtain  the  linearized  measurement  equation 

z - VAb  + (33) 

If  there  Is  additive  measurement  noise,  n e then  the  linearized 
measurement  equation  becomes 

z - VAb  + + n (34) 

This  equation,  although  quite  simple.  Is  of  fundamental  Importance  to  our 
later  development  of  Identlflablllty  criteria  and  Input  design;  it  Is 
this  equation  which  clearly  demonstrates  the  Importance  of  the  sensitivity 
operators  In  the  parameter  Identification  problem.  To  first  order,  the 
sensitivity  operators  tell  one  how  much  the  system  output  will  be  perturbed 
by  small  changes  In  the  system  parameters.  If  the  sensitivity  Is  high, 
then.  In  some  sense,  one  can  more  accurately  estimate  the  true  parameter 
values. 

Now  assume  the  error  term  (which  is  of  second  order  In  Ab)  Is 
either  small  or  can  be  compensated  for  through  Iterations.  Then  the  out- 
put error  functional  becomes 

Jljj(Ab)  - fz  - VAb/z  - VAb]  (35) 

Since  V and  are  Hilbert  spaces  and  V e L (R^,  V),  the  theory  of  least 

c 

squares  (see  Luenberger  (Ref  20:  160))  may  be  used  to  give  the  unique 

minimizing  solution 


Ab*  - (V*V)"^  V*  z (36) 

provided  that  (V*V)  ^ exists.  Note  that,  V*V  Is  the  p x p Gram  matrix 
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with  row  and  column  element 


V*V(i,j)  •= 


while  V*z  Is  the  p x 1 vector  with  1^^  element 


V*z  (1)  = [v’-^Vz] 


Thus  It  Is  relatively  easy  to  compute  the  minimizing  solution  Ab*.  Also 
since  V is  a linear  continuous  mapping  from  the  finite  dimensional  space, 
R**,  Its  image  space  must  be  finite  dimensional  in  V , and  hence  must  be 
closed.  Then  a necessary  and  sufficient  condition  for  (V*V)  ^ to  exist 
is  that  V be  a one-to-one  mapping  (Ref  10) . These  statements  form  the 
basis  of  our  identlf lability  discussion  of  the  next  section. 

The  updated  parameter  estimate  is  then  determined  from  the  equation 


b,  = b + Ab* 
1 o 


- b + (V*V)"^  V*  z 
o 


The  system  output  may  be  linearized  about  b^^  and  y(bj^),  and  the  process 
of  obtaining  a new  Ab*  repeated.  Such  a procedure  is  a generalization  of 
the  method  of  quasllinearlzatlon  and  has  found  wide-spread  application  in 
system  parameter  identification  (see  Section  I.l.c). 

The  linearized  measurement  equation  (34)  and  the  corresponding  least 
squares  solution,  equation  (36)  provide  the  greatest  motivation  for  our 
subsequent  discussion  of  identif lability  and  input  design;  however,  it 
is  instructive  to  note  the  role  which  the  sensitivity  operators  play  in 
other  Iterative  metliods  for  minimizing  the  output  error  functional, 

J^(b  ),  equation  (29).  In  the  next  section  we  briefly  discuss  first  and 
second  order  gradient  procedures,  and  compare  the.se  algorithms  with  the 
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quaslllnearlzatlon  algorithm.  In  all  three  cases,  the  sensitivity 


operators  have  fundamental  Importance  In  the  Iterations. 


c.  Gradient  Minimization  Procedures 

Once  again  consider  the  general  output  error  functional,  J^(b), 
equation  (2$),  and  assume  that  both  the  first  order  sensitivities 
v^^\  1 1,  2,  ...p,  and  the  second  order  sensitivities 


i.  J = 1.  2, 


b 

o 


• P 


(40) 


^ P 

exist  and  are  continuous  for  all  b^  e R*^.  Then  it  Is  straight-forward 
to  show  that  the  first  and  second  order  gradients  of  are 


PJ  d)  = -2V*z 
e 

(41) 

(b)  » 2(V*V  - .V*  z) 
e D 

(42) 

where  V*z  and  V*V  are  defined  by  equations  (38) 
and  the  symmetric  p x p matrix  vf  z has  l*"^  row 

D 

and  (37),  respectively, 

and  column  element 

V*  z(l,J)  - [v^^*^Vz] 

(43) 

Sufficient  conditions  for  b*  c R^  to  be  a local  minimizing  solution 

A 

of  J^(b)  are  well-known  and  are  stated  in  the  following  lemma; 

Lemma  3 

For  b E R suppose  that  ) “ 0 and  that  the  symmetric  p x p 

2 ^ i A 

matrix  V J (b  ) Is  positive  definite.  Then  b is  a local  minimum  of 
e 

J (b). 

e 

Comments 

1.  Since  y^)  nwy  be  a general  nonlinear  function  of  S,  no  more 

than  a local  minimizing  solution  may  be  guaranteed;  that  Is,  the  output 
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error  functional  J (b)  may  have  more  than  one  local  minimum, 
e 

A 

2.  If  2 ■ y - y(b)  Is  allowed  to  take  on  any  value,  then  It  is 

necessary  that  the  matrix  V*V  be  positive  definite  to  ensure  that 
2 

7 Jg(b)  is  positive  definite.  In  Section  II. 7 we  will  see  that  this 
fact  Is  Intimately  related  to  the  conditions  for  local  loentlf lability. 

A 

A first  order  gradient  algorithm  to  find  the  minimizing  b may  now 
be  defined  by  the  sequence  of  steps  (see,  for  example,  Luenberger 
(Ref  20:  283  - 287)) 


^ ^ n A 

b . , - b - ^ 7J  (b  ) 
n+1  n 2 e n 


b + s V z 
n n 


(44) 


The  gradient,  2V  z.  Is  evaluated  at  b , and  the  scalar  step  size  s Is 

n n 

^ A 

chosen  to  minimize  J (b  + s V z).  Through  the  first  order  linearl- 
e n n ® 

zatlon 


y(b^  + 8 V*  z)  ' ^ ® ^ 


(45) 


we  see  that 


Je(bn  + s V*  z)  = [z/z]  - 2s[2/W*  z]  + 8^[W*  z/W*  z] 


Thus,  the  approximate  minimizing  s^  Is  given  by 


(46) 


.*  - 


IV*  z/V*  z] 

[V*  z/(V*V)V*  z] 


(47) 


provided  that  the  denominator  Is  non-zero.  For  arbitrary  V*z  j*  0 

A 

(recall  that  z“y-y(b  ))  a sufficient  condition  for  the  denominator 

n 

to  be  non-zero  Is  that  the  symmetric  p x p matrix,  V*V,  (evaluated  at  b ) 

n 

be  positive  definite.  The  gradient  steps  are  then  repeated  until  the 
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gradient  vector  V*z,  has  a norm  less  than  some  preselected  small  positive 
value.  Using  the  step  size  s*,  the  method  Is  known  as  steepest  descent 
(Luenberger  (Ref  20:  286)). 

Similarly,  the  Newton-Raphson  algorithm  may  be  used  to  Iteratively 
compute  the  minimizing  b*  e R^,  (a.ee,  for  example,  Luenberger  (Ref  20: 
284)).  Such  a procedure  Is  defined  by  the  sequence  of  steps 


b - b - (V^J  (b J)*^ 
n+1  n e n e n 


- £ + (V*V  - V*  z)"^  V*  z 
n D 


(48) 


provided  that  the  Indicated  Inverse  exists.  Again,  for  arbitrary  V*  z, 
a necessary  condition  for  the  second  order  gradient  matrix,  V^J(6^),  to 
be  positive  definite  Is  that  the  symmetric  p x p matrix  V*V  (evaluated  at 

A 

b ) be  positive  definite.  Once  again  the  sensitivity  operators  play  a 
n 

fundamental  role  In  the  Iterations. 

Now  compare  the  first  order  gradient,  the  quaslllnearlzatlon,  and 
the  Newton-Raphson  algorithms.  All  three  have  the  same  form: 


b + K V*  z 
n n 


(49) 


For  the  first  order  gradient  algorithm,  the  weighting  matrix  K Is  the 

n 

scalar  step  size  s which  may  be  selected  according  to  equation  (47). 
n 

For  quaslllnearlzatlon,  K Is  the  symmetric  p x p matrix,  (V^V)  and 

n 

for  the  Newton-Raphson  algorithm  It  Is  the  second  order,  symmetric, 
p X p,  gradient  matrix  (V*V  - V*  z)  As  stated  previously,  we  are 
primarily  Interested  In  the  linearized  measurement  equation  (34)  and 
the  associated  quaslllnearlzatlon  algorithm,  but  It  Is  Interesting  to 
note  the  similarities  between  the  quaslllnearlzatlon  method  and  these 
other  procedures.  In  all  three  cases,  the  sensitivity  operators  play 
a key  role  In  the  Iterations. 
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It  is  also  Instructive  to  note  the  appearance  In  all  three  algo- 
rithms of  the  symmetric,  p x p,  positive  matrix,  V*V.  Suppose  that  we 
use  the  approximate  optimal  s*  as  given  by  equation  (47)  In  the  first 
order  gradient  algorithm.  Then  In  each  of  the  three  algorithms  we  re- 
quire V*V  to  be  positive  definite  In  order  that  we  may  ensure  a priori 
a local  solution  to  each  Iterative  step  (considering  z e V to  take  on 
any  value).  In  Section  II. 7 the  matrix  V*V  Is  shown  to  be  of  basic 
Importance  to  the  question  of  local  Identlf lability  of  the  syst^em 
parameters.  Hence,  we  term  this  p x p symmetric  matrix  the  "local 
Identlf lability  matrix"  and  give  It  the  symbol 

M(b^)  - V*(b^)V(b^)  (50) 

/s 

The  dependency  upon  b^  Is  shoim  explicitly  here  for  emphasis. 


d.  Generalized  Norm  and  Measurement  Noise 

In  this  section  the  results  of  Sections  II. 6. b and  II. 6. c are  con- 
sidered for  the  minimization  of  the  more  general  output  error  functional 

jQ(b)  “ [(y  - y(b))/Q(y  - y(b))] 

”■  / [y(t)  - y(t;b)/Q(t)(y(t)  - y(t;b))]  dt  (51) 

For  each  t e [t^,  t^]  the  m x m matrix  Q(t)  e R™  * ™ Is  assumed  to  be 
symmetric  and  uniformly  bounded  and  positive  definite*.  Such  an  output 
error  functional  would  generally  find  application  In  maximum  likelihood 
Identification  when  there  Is  zero-mean  additive  white  Gaussian  measure-  1 

ment  noise,  n e 1^,  with  covariance 


*That  Is,  there  exists  positive  constants,  c^^  and  C2»  such  that 

£ [•/Q(t)']  ^ C2[*/*l  for  all  t c (t^,  t^l. 
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E(n(t)n^(T))  - Q'^(t)6(t  - t) 


(see,  e.g.,  (Ref  85)).  In  the  notation  above,  £(•)  is  the  expectation 
operator,  the  superscript  "T"  Is  the  transpose,  and  6(*)  Is  the  unit 
sampling  function  defined  by  fi(t)  ••  0 for  t ^ 0,  and 


I 6(t)g(t)dt  - g(0) 


for  any  arbitrary  function  g. 

Using  the  assumptions  concerning  Q and  Introducing  the  Q-lnner 


product  defined  by^: 


ty/yjq  “ [y/Qy] 


for  ye/”  straight-forward  to  show  that  the  Hilbert 

space  {V', (•/•]}  Is  Isometrlcally  Isomorphic  to  the  Q-Hllbert  space 

(Ref  27:  84  - 85),  and  so  the  two  may  be  treated  Interchange- 

ably. In  terms  of  the  Q-Hllbert  space,  the  output  error  functional 
Jp(b)  takes  the  form 

jQ(b)  “ [y  - y.(b)/y  - y(h)]Q  (55) 

A 

Thus,  cur  previous  results  concerning  Jg(b)  can  be  readily  transferred 
to  this  more  general  case. 

To  Illustrate,  consider  the  adjoint  of  V mapping  from  the  Hilbert 
space  {R^, [•/•]}  Into  the  Hilbert  space  {V,[./*]q}.  Suppose  that  V*  Is 
defined  by 


ly/vb]  - [V*y/b) 


for  y e /,  b c R^.  Then 


*Inner  products  not  subscripted  are  assumed  to  be  the  natural  Inner  product. 
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ly/Vb]^  - [y/QVb]  - [V*Qy/bl . (57) 

Therefore,  V*Q  is  the  adjoint  of  V mapping  between  (R^, !•/•])  and 

Thus,  while  the  matrix  V*V  was  of  fundamental  Importance  In 

A 

minimizing  J^Cb),  we  see  that  the  counterpart  matrix 

V*QV  - lv^^^(t)/Q(t)v^^^(t)]dt  (58) 

t 

o 

A 

Is  basic  to  minimizing  JQ(b).  If  Q has  the  physical  meaning  described  In 
the  first  paragraph  of  this  section,  then  this  matrix  Is  termed  the 
"Information  matrix"  (Ref  31) . It  has  general  Importance  In  estimation 
theory  because  the  covariance  of  estimation  error  for  any  unbiased 
estimator  Is  limited  by  the  Cramer-Rao  lower  bound  (Ref  31) : 

E((b  - b*)(b  - b*)"^)  >,  (V*(b)QV(b))“^  (59) 

where  b c R^  is  the  true  parameter  value. 

With  this  short  Introduction  into  the  relationship  between  parameter 
sensitivity  and  system  Identification,  we  now  discuss  the  related  topics 
of  Identif lability  and  sensitivity  design  for  optimal  Identification. 

7.  SYSTEM  PARAMETER  IDENTIFIABILITY 

In  this  section  we  discuss  parameter  Identif lability.  Background 

literature  on  this  topic  was  presented  In  Section  I.l.d.  Lljte  the  ti-c- 

tlons  on  Insensitivity  and  controllability  our  primary  purpose  here  Is 

to  establish  the  definitions  and  an  operator  methodology  for  treating 

Identlflabillty.  In  Section  IV  this  approach  will  be  applied  to  linear 

time- Invariant  ordinary  differential  equation  systems  to  yield  algebraic 

conditions  for  Identlflabillty. 

Roughly  stated,  parameter  Identlflabillty  Is  the  question  of 

determining  a priori  system  conditions  which  ensure  that  a unique  best 
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esciroate  of  the  unknown  parameters  may  be  obtained  from  the  observable 
output.  Such  a concept  of  Ident If lability  was  Introduced  by  Lee  (Ref  19). 
As  noted  In  Section  I.l.d  various  other  probabilistic  Interpretations  of 
Identlflablllty  have  been  utilized;  however,  uniqueness  of  solution  Is 
most  consistent  with  our  deterministic  approach  to  parameter  Identifica- 
tion, and  It  Is  the  concept  of  identlflablllty  which  has  been  used  by 
quite  a number  of  other  researchers  In  the  determination  of  structural 
properties  of  identlflablllty.  (See  Section  I.l.d  for  discussion.) 

To  motivate  the  definition  of  local  Identlflablllty  which  we  adopt, 
again  consider  the  linearized  measurement  equation 

2 = y _ y(b^)  “ VAb  + + n (60) 

The  bias  is  of  second  order  in  Ab  and  the  measurement  noise  process 
n e y Is  immaterial  for  our  deterministic,  "least  squares"  approach. 

(See  Section  II. 6. d.)  Then  for  values  of  b which  are  local  to  b 

o 

equation  (34)  becomes  approximately 

z - VAb.  (61) 

For  a given  measurement  error  process  z e V,  it  Is  well-known  that  the 
above  equation  has  a unique  solution  for  Ab  If  and  only  if  the  operator 
V Is  one-to-one.  Therefore,  we  define: 

Definition  5 

The  system  S Is  locally  Identifiable  at  b e R^  if  V e L (R^,^)  Is 
H O C 

one-to-one. 

Definition  6 

The  locally  non- Identifiable  subspac.e  at  b^  e R^  Is  the  null  space 
of  V while  the  locally  identifiable  subspace  is  the  orthogonal  comple- 
ment of  the  non-ldentlf lable  subspace  M(b^)  - W(V)1’ 
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The  non-ldenClflable  subspace  is  closely  related  to  the  concepts  of 
Insensitivity.  In  vords,  the  non-ldentlflable  subspace  Is  the  subspace 
In  which  parameter  perturbations  will  cause  no  first  order  change  In  the 
observable  output.  This  relationship  between  system  parameter  insensi- 
tivity and  system  non-identif iability  has  been  recognized  previously  and 
is  discussed  further  in  Section  IV.  (See,  for  example,  Bonivento  (Ref  55).) 

P 

V is  bounded  linear  operator  from  the  finite  dimensional  space  R 
to  the  function  space  Y and  so  must  have  a finite  dimensional  nange  space. 
Thus  R(V)  must  be  closed  in  Y,  and  so  we  may  immediately  obtain  the 
following  lemma: 

Lemma  4 

The  system  S^  is  locally  identifiable  at  b^  e R^  if  and  only  if  the 
p X p dimensioned  symmetric  "Identlflablllty  matrix" 


M(b  ) - V*V 
o 


has  an  Inverse.  Further,  the  non-identif lable  subspace  is 


(62) 


N(M(b^))CR‘’ 


while  the  identifiable  subspace  is 


(63) 


M(b  ) i R(M(b^))CRP 
o o 


(64) 


Proof 


The  first  two  parts  of  the  theorem  follow  directly  from  Definitions 
5 and  6 and  standard  results  from  functional  analysis  (see,  e.g.,  Taylor 
(Ref  27:  250  - 251)).  The  last  part  of  the  theorem  follows  from  the 
fact  that  M(6^)  is  a symmetric  matrix  and  property  2 of  Lemma  1. 

Q.E.D. 
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From  Lemma  4 and  the  results  of  Section  II. 6. b (see  equation  (36)) 

we  see  that  system  S„  Is  locally  Identifiable  If  and  only  If  we  may 
n 

obtain  a unique  solution  of  linearized  measurement  equation  error  func- 
tional, J,,,(4b),  equation  (35).  Thus  we  see  the  relationship  between 
LH 

the  definition  for  local  identlf lability  and  the  ability  to  obtain, 
locally,  a unique  minimum  norm  estimate  of  b.  Indeed,  Lemma  II. 5 could 
be  taken  as  the  definition  of  local  Identiflabllity  and  definitions  5 
and  6 then  derived  as  lemmas  to  this  alternate  definition.  Th&  Is  the 
approach  which  several  others  have  taken  (e.g.,  Martenson  (Ref  96), 
Glover  and  Willems  (Ref  93)),  but  either  approach  is  equivalent.  Also, 
In  the  comments  of  Section  II. 6. c we  noted  that  the  local  information 
matrix  must  be  positive  definite  to  ensure  the  existence  of  local 
steps  In  the  steepest  descent  and  Newton-Raphson  Iterations,  as  well. 

Since  the  self-adjoint  operator  Q of  our  generalized  norm  func- 
tional, equation  (51) ,•  Is  assumed  Invertible,  It  Is  easy  to  show  that 


a.  R(V  QV)  - R(V*V) 


b.  N(V*QV)  - N(V*V) 


c.  (V*QV)  ^ exists  If  and  only  If  (V*V)  ^ exists. 


Thus  the  local  information  matrix,  MQ(b^)  = V*QV,  may  be  substituted 
for  the  Identiflabllity  matrix,  M(b^),  in  Lemma  4,  and  the  conclusions 
remain  the  same.  With  this  substitution.  Lemma  4 Is  a restatement  of 
the  well-known  result  that  the  system  Is  locally  Identifiable  If  and 
only  If  the  local  Information  matrix,  MQ(b^),  Is  positive  definite. 

(See,  for  example,  Tse  (Ref  102).) 

In  Section  IV  we  apply  these  results  to  obtain  local  identiflabllity 
conditions  for  linear  ordinary  differential  equation  systems.  For  time 
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varying  systems.  Lemma  4 and  the  "Identlflablllty  matrix"  may  be  used 
to  test  for  local  Identlflablllty;  however,  for  time-invariant  systems 
an  algebraic  description  of  the  sensitivity  operators  allows  us  to  use 
Definitions  5 and  6 directly  and  thereby  obtain  algebraic  criteria  for 
local  Identlflablllty.  Using  these  algebraic  conditions  we  will  see 
that  we  may  not  only  test  for  local  Identlflablllty  at  the  nominal 
b^  e R^,  but  we  may  treat  b e as  a variable  and  test  for  Identlfla- 
blllty as  a function  of  b.  Such  a concept  was  Introduced  by  Gupta  and 
Hall  (Ref  108)  In  consideration  of  "structural"  controllability  of  the 
sensitivity  operators  of  linear  time-invariant  ordinary  differential 
eqtiatlon  systems.  Therefore,  we  define: 

Definition  7 

The  system  Is  structurally  Identifiable  If  V(b)  e L^(R^,y)  Is 
one-to-one  for  almost  all  b e R^. 

Such  a structural  concept  Is  particularly  significant  for 
Identlflablllty  because  one  wishes  to  ensure  that  the  system  will  be 
"locally"  Identifiable  along  the  entire  sequence  of  possible  parameter 
estimates. 

Finally  not  only  may  It  be  Important  to  know  whether  the  param- 
eters of  a linear  system  are  locally  Identifiable  from  the  total  system 
output,  but  It  may  be  equally  Important  to  know  whether  the .q>arameters 
are  locally  Identifiable  from  the  zero-input  response  alone  or  from  the 
zero-state  response  alone  or  from  both.  For  example,  such  Information 
might  be  Important  In  the  design  of  experiments  for  parameter  Identifica- 
tion (see  Section  II. 8).  This  Information  Is  easily  obtained  by  con- 
sidering whether  or  not  the  zero-input  and  zero-state  sensitivity 
operators  are  one-to-one,  respectively.  Therefore,  definitions  parallel 


to  Definitions  5 and  6 can  be  made  for  zero-input  and  zero-state  local 
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Identif lability,  and  Lemma  4 may  be  given  appropriate  corollaries  for 
the  "zero-input"  and  "zero-state"  local  Identlf lability  matrices.  Also, 
just  as  the  case  for  parameter  insensitivity,  we  will  find  that  It  Is 
more  convenient  to  obtain  separate  conditions  for  zero-input  and  zero- 
state  local  identlfiablllty  rather  than  obtaining  joint  conditions  (see 
Section  IV).  The  following  lemma  relates  the  separate  zero-input  and 
zero-state  locally  identifiable  subspaces  to  the  total  locally  identi- 
fiable subspace: 

Lemma  5 

Let  V . and  V be  the  zero-input  and  zero-state  sensitivity 

Zeie  Z«S« 

operators  for  the  system  S„  evaluated  at  the  nominal  b e R^.  Suppose 

n O 


that  V . b ^ -V  b for  all 
z.l.  z.s. 


b e (N(V  . ))*CR**,  b j*  0. 
z.l. 


Then  the  total  system  non-ldentlflable  subspace  is 


^^(V)  - W(V^  . )nN(v^  ^ ) 

Zs^a  Ze8« 


and  the  total  identifiable  subspace  is 


(67) 


(68) 


K ^ « (b) 

O ZeXe  O Ze8«  O 


(69) 


Proof 


Since  V-V  . +V  ^ it  is  clear  that 

ZaXe  ZeSa 


w<v,  . )nv(v,  ^ )CW(v). 

z.l.  z.s. 


(70) 


Suppose  b e N(V)  and  b ^ 0.  Then 


V . b + V b - 0, 
z.l.  z.s. 


(71) 
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this  would  contradict  the  assumption  (67).  Hence  b e M(V  . ).  This 

Z • X * 

along  with  equation  (71)  implies  that  b e M(V  ) . Thus 

z.  s. 

W(V)CW(V,  . )nN(V^  ^ ) (72) 

and  the  assertion  (68)  is  proved. 

Finally,  since  M(V)CR^,  equation  (69)  is  obtained  by  taking  the 
orthogonal  complement  of  both  sides  of  equation  (68).  (See,  for  example, 

t 

Nerlng,  Theorem  4.4  (Ref  21:  140).) 

Q.E.D. 

In  this  section  we  have  considered  the  binary  question  of  whether  or 
not  the  parameters  of  a linear  system  are  locally  identifiable  at  an 
a priori  parameter  value  b^  e R^.  We  have  not  discussed  the  "quality" 
of  the  identification  (estimation)  capability.  This  important  experi- 
mental design  question  is  discussed  in  the  next  section. 

8.  SENSITIVITY  OPERATOR  DESIGN  FOR  OPTIMAL  PARAMETER  IDENTIFICATION 

In  Section  II. 6 we  discussed  the  fact  that  the  "quality"  of  param- 
eter identification  capability  is  strongly  dependent  upon  the  parameter 
sensitivity  operators.  Heuristlcally,  if  we  consider  the  linearized 
measurement  equation  (34)  of  Section  II. 6. a with  process  noise  n e.  V 
and  a priori  parameter  value  b^  e R^, 

z ■ V4b  + Ej^  + n (34) 

then  the  sensitivity  operator  V may  be  likened  to  the  signal  strength 

in  a communication  channel.  For  improved  identification,  one  would 

like  to  maximize  the  "slgnal-to-nolse"  ratio.  If  Q ^ is  the  covariance 

matrix  of  the  zero-mean  Gaussian  white  noise  process  (see  Section  II. 6. d) 
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then  the  local  information  matrix,  • V*QV  may  be  viewed  as  the 
"signal-to-noise"  ratio  for  the  multi-channel,  multi-parameter,  estima- 
tion problem,  equation  (34).  Also,  we  commented  in  Section  II. 6. d the 
information  matrix  evaluated  at  the  true  system  parameter  value,  b, 
provides  a lower  bound  for  the  estimation  error  covariance  matrix  of 
any  unbiased  estimate  through  the  Cramer-Rao  inequality  (59).  Since 
we  do  not  know  the  true  parameter  value,  b,  we  can  only  approximate 
this  lower  bound  through  the  use  of  the  local  information  matrix,  M^. 

Therefore,  based  upon  the  preceding  discussion,  we  will  use  some 
measure  on  the  local  information  matrix  as  a criterion  of  optimality  to 
"design"  system  nominal  sensitivity  operators  for  improved  parameter 
identification.  Indeed,  for  just  these  same  reasons  the  local  informa- 
tion matrix  has  been  used  as  a design  criterion  for  improved  parameter 
identification  by  quite  a number  of  previous  researchers  (see  Section 
I.l.e  for  a discussion  of  the  relevant  literature  on  input  design). 
Since  an  optimization  criterion  generally  requires  a scalar  quantity, 
some  suitable  scalar  measure  on  the  p x p information  matrix  muSt  be 
obtained.  This  question  is  discussed  briefly  in  the  next  subsection. 

If  we  assume  that  both  the  initial  condition  vector,  d e D,  and 
the  control  input  u e U,  are  Independent  of  the  unknown  parameter 
vector,  b e R^,  and  that  they  may  be  selected  arbitrarily,  then  the 
nominal  sensitivity  operators  are  functions  of  both  of  these  quanti- 
ties. To  be  more  explicit,  the  zero-input  sensitivities  are  linear 
operators  on  d e D 

'“L  W*  ■ <’■>  (!)■* 

while  the  zero-state  sensitivities  are  linear  operators  on  the  control 
u c 1/ 
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(u)  - (W).,.u  (74) 

Z.S. 

i ■ 1,  2,  ...p.  Therefore,  both  d and  u may  be  selected  to  optimize  some 
measure  of  the  local  information  matrix.  This  optimization  could  be 
accomplished  concurrently;  however,  it  Is  computationally  more  convenient 
to  consider  the  separate  selection  of  the  Initial  condition  vector  which 
optimizes  the  "zero-input"  local  Information  matrix  on  the  one  hand  and 
the  control  function  which  optimizes  the  "zero-state"  local  Information 
matrix  on  the  other.  These  computational  advantages  will  be  understood 
more  clearly  when  this  topic  is  discussed  for  linear  time-invariant  ordinary 
differential  equation  systems  In  Section  VII.  Also,  this  separate  design 
of  the  Initial  condition  vector  and  the  control  Input  based  upon  the  zero- 
input  and  zero-state  responses,  respectively.  Is  related  to  the  previous 
section  concerning  the  separate  determination  of  the  zero-input  and  zero- 
state  Identifiable  subspaces.  Indeed,  certain  parameter  components  may 
be  Identifiable  from  one  response  while  not  from  the  other.  Thus,  from 
a practical  polnt-of-vlew.  It  may  be  desirable  to  design  separate  experi- 
ments with  zero  Initial  conditions  In  one  case  and  zero  control  Inputs  in 
the  other. 

Finally,  we  comment  that  the  approach  taken  In  this  section  Is 
motivated  by  the  work  of  Mehra  (Ref  112) . Mehra  considers  the  problem 
of  control  Input  design  for  optimal  parameter  Identification  In  linear 
time-invariant  ordinary  differential  equation  systems.  For  the  special 
case  of  zero  Initial  conditions,  he  obtains  an  operator, formulation  of 
the  optimization  problem  and  shows  that  the  optimal  control  Is  the  eigen- 
function corresponding  to  the  maximal  eigenvalue  of  a positive,  self- 
adjolnt  operator.  The  approach  taken  here  Is  a generalization  of  Mehra 's 
work,  and  conceptually  there  Is  very  little  difference.  However,  it 


should  be  mentioned  that  the  concept  of  formally  optimizing  the  initial 
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condition  vector  to  provide  optimal  parameter  Identification  capability 
In  the  transient  response  Is  a new  concept  to  the  current  theoretical 
literature  on  parameter  Identification.  On  the  other  hand,  the  manipu- 
lation of  the  Initial  condition  vector  to  provide  Improved  Identifica- 
tion Is  a well-known,  and  often  very  important,  ad-hoc  procedure  for 
providing  parameter  Identification  capability  in  practical  applications. 
(See,  for  example,  (Ref  85)  in  which  Initial  platform  orientation  plays 
an  essential  role  in  the  identification  of  Inertial  platform  p^ameters.) 
The  capability  to  determine  optimal  Initial  conditions  for  Identification 
purposes  might  be  particularly  Important  In  cases  In  which  It  Is  either 
Impossible  or  very  costly  to  provide  a control  Input  in  order  to  Identify 
system  parameters. 


a.  Optimization  Criterion 

Because  the  information  matrix  Is  dimension  p x p,  some  suitable 

I 

scalar  measure  must  be  chosen  as  an  optimization  criterion.  This 
problem  has  been  discussed  by  a number  of  researchers,  and  Section  I.l.e 
reviews  the  literature  on  this  subject. 


From  an  Information  theory  viewpoint,  either  a linear  functional  of 
the  dispersion  matrix  (Inverse  of  the  local  Information  matrix)  or  the 
determinant  of  the  dispersion  matrix  are  the  most  appropriate  criterion. 
From  the  Cramer-Rao  lower  bound  we  know  that  the  dispersion  matrix  is  a 
lower  bound  for  the  covariance  matrix  of  the  parameter  estimation  error, 
and  so  a linear  functional  of  the  dispersion  matrix  (usnally  a weighted 
trace)  is  directly  related  to  the  variance  of  estimation  error.  On  the 
other  hand,  the  determinant  of  the  dispersion  matrix  Is  physically 
appealing  because  It  Is  proportional  to  the  volume  of  the  error-elllpsold 
In  p-dimenslonal  parameter  space. 
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However,  the  direct  use  of  either  of  these  criteria  leads  to  diffi- 
cult computational  problems  In  an  optimization  algorithm.  Therefore,  a 
criterion  used  most  frequently  because  of  the  convenient  quadratic  func- 
tional which  It  provides  Is  a linear  functional  on  the  information  matrix 
Itself.  For  this  reason  we  choose  a weighted  trace  of  the  Information 
matrix 

Jjj(d,u)  - Z [v^^Vqv^^]  (75) 

1,J“1 

as  our  optimization  criterion.  It  is  fairly  well-known  (e.g.  (Ref  114), 

(Ref  119),  and  (Ref  122))  that  such  a criterion  can  lead  to  identifi- 

abillty  problems  and  poor  estimates  of  the  parameters  if  the  weighting 

constants  k^^^  are  not  selected  judiciously.  However,  Nahi  and  Napjus 

(Ref  117),  Mehra  (Ref  114)  and  (Ref  115),  and  Gupta  and  Hall  (Ref  108) 

develop  iterative  algorithms  to  adjust  the  weightings  k^j , 1,  j*l,  2,  ...p 

so  that  the  resulting  maximum  of  J will  minimize  either  a linear  func- 

tlonal  on  the  dispersion  matrix  or  the  determinant  of  the  dispersion 

matrix.  The  maximization  of  J Is  a basic  and  most  time-consuming  part 

1\ 

of  each  of  these  Iterative  techniques,  and  so  It  Is  quite  apropos  to 
utilize  Jj^  as  our  optimization  criterion. 

b.  Selection  of  Initial  Conditions 

In  this  section  we  assume  that  the  system  S„  has  zero  control  Input 

n 

and  that  we  wish  to  select  an  Initial  condition  vector,  d*  c D,  which 
optimizes  parameter  Identification  capability  from  the  zero-input  response 
The  optimization  functional  takes  the  form 
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z*l« 


I k.,[vf^J  (d)/QvpJ  (d)] 
ij  2.1.  2.1. 


J.J-1 


Z k^j[T^j,jd/QT^jjd]  (76) 

l.J-1 


If  we  let  T*,..  e L (/,D)  denote  the  adjoint  of  T,..  e L (D,/)  then 
(.1;  c (1;  c 

J (d)  takes  the  form 

2.1. 

Jjj  (d)  - [d/Aj^  d]  (77) 

2.1.  2.1. 


where  Aj^  e L (D,D)  Is  the  positive,  bounded  self-adjoint  operator 

2.1.  ^ 


E k,.  T 


QT, 


z • J.  * 


l.J-1 


ij  (i)^  (j7 


(78) 


For  linear  ordinary  differential  equation  systems,  D is  the  finite  dimen- 
sional space  r”,  and  so  is  a positive,  symmetric  n x n matrix. 

2.1. 

Thus  (d)  is  a quadratic  form  in  d.  Now  in  order  to  obtain  a 

K j 

z.l. 

unique  maximum  of  this  functional,  some  auxiliary  constraint  must  be 
placed  upon  the  initial  condition  vector,  d e D.  It  is  convenient  to 
assume  that  d must  satisfy  the  "energy"  constraint^ 


[d/d]  < 1. 


(79) 


^Note  that  there  is  no  loss  in  generality  by  this  assumption;  for,  suppose 
we  wish  to  satisfy  the  more  general  constraint  [d*/Ed*]  ^ 1,  where  E is  a 
strictly  positive,  self-adjoint  operator.  Defining  the  E-lnner  product 

[•/•Ig  >•  [•/£•] 

it  is  easily  shown  that  the  Hilbert  space  {D,(./.]j,}  Is  isomctrlcally 
isomorphic  to  the  natural  Hilbert  space  {D,f./.I}.  Therefore,  the  con- 
straint which  we  have  assumed  may  easily  be  extended  to  this  more  general 
case. 
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Then  It  Is  well  known  (e.g.,  Blxim  (Ref  3:  601))  that  the  d*  c D which 

maximizes  J~  (d)  and  satisfies  the  constraint  (79),  Is  the  eigen- 

Zeie 

vector  of  which  corresponds  to  the  largest  eigenvalue  of  the 

z.l.  ^ 

self-adjoint  operator  In  such  a case,  [d  /d*]  ■■  1,  and  the 

z • 1 • 

eigenvector  need  not  be  unique.  If  the  operator  Aj^  e L^(D,D)  Is 

z.l. 

compact  (which  It  will  be  If  D « r”),  then  we  are  assured  that  a 
maximizing  eigenvector,  d*  c D,  does  exist.  (See,  for  example,  Blum 
(Ref  3;  601).) 

In  Section  VII  we  consider  this  problem  formulation  for  linear 

time-invariant  ordinary  differential  equation  systems,  and  the  positive, 

symmetric  n x n matrix,  A^^  Is  easily  computed.  Obtaining  the  optimal 

z.l. 

Initial  condition  vector  Is  then  merely  a matter  of  determining  the 
maximizing  eigenvector  of  A^^ 

z.l. 


c.  Selection  of  Control  Input 

In  this  section  we  assume  that  the  system  S has  zero  Initial  condl- 

n 

tlons  and  that  we  wish  to  select  a control  Input,  u*  e U,  which  optimizes 
parameter  Identification  capability  from  the  zero-state  response.  Thus 
the  optimization  functional  takes  the  form 


(u) 


l.j-1 


P 

- £ <80) 

i.J-1 

This  problem  Is  a ^ omplete  parallel  of  the  zero-input  design  problem  of 

the  previous  section.  Let  W*...  c L (y,U)  denote  the  adjoint  operator 

{ij  c 

of  c L^(U,y).  Then 

Jjj  (u)  - [u/Ajj  u]  (81) 


.i.il.HiilHUIIWPIWiBi'  .11  I Ii.il.il  W'W 
...  , ■:  - 
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where 


A„  e L (UfU)  is  the  positive,  bounded,  self-adjoint  operator 


l.j.l 

Again  we  assume  a control  "energy"  constraint^ 


[u/u]  £ 1. 


(82) 


(83) 


Tn^n  the  u*  c U which  maximizes  fu)  and  satisfies  the  constraint 

z.s. 

(83),  Is  the  eigenfunction  corresponding  to  the  largest  eigenvalue  of 

the  positive,  self-adjoint  operator  A^^  Again,  It  will  be  true  that 

z.s. 

[u*/u*]  > 1,  and  the  maximizing  eigenfunction  need  not  be  unique.  Also, 

If  A^  Is  compact,  we  are  assured  the  existence  of  the  maximizing 

z.s. 

eigenfunction.  (See,  Blum,  (Ref  3:  601).)  The  operator  Aj^  , equa- 

z.s. 

tlon  (82),  will  be  compact  If  each  of  the  partlals  1 ■■  1,  2,  ...p, 

are  compact  (products  ’of  bounded  linear  operators  and  compact  operators 
are  compact,  and  linear  combinations  of  compact  operators  are  compact 
(Ref  27:  274)).  For  our  later  application  to  linear  ordinary  differen- 
tial equation  systems,  the  operators  Fredholm  Integral  equations 

on  the  compact  Interval  [t^,  t^l  and  have  continuous  kernels,  and  so  they 
will  Indeed  be  compact  (see  Llustemlk  and  Sobolev  (Ref  19:  129)). 

The  above  presentation  Is  a generalization  of  the  results  of  Mehra 
(Ref  112)  to  control  Input  design  for  the  Hilbert  space  system  S^.  Mehra 
suggests  various  computational  methods  of  solution  for  (his  problem,  but 


^Note  that  there  Is  no  loss  In  generality  by  this  assumption;  for,  suppose 
we  wish  to  satisfy  the  more  general  constraint  (u*/Eu*]  £ 1,  where  E is  a 
strictly  positive , self-adjoint  operator.  Defining  the  E-lnner  product 

[•/•Ie  ” (-/E-J 

It  Is  easily  shown  that  the  Hilbert  space  Is  Isometrlcally 

Isomorphic  to  the  natural  Hilbert  space  (U, [•/•])•  Therefore,  the  con- 
straint which  we  have  assumed  may  easily  be  extended  to  this  more  general 
case. 
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we  will  not  discuss  these  until  Section  VII.  There  we  will  examine 
computational  techniques  associated  with  Input  design  for  linear  time- 
invariant  ordinary  differential  equation  systems. 

9.  SUMMARY 

This  concludes  our  treatment  of  parameter  sensitivities  for  the 
Hilbert  space  linear  system,  S . Some  known  concepts  have  been  extended 
to  this  Hilbert  space  setting  (e.g.,  parameter  insensitivity,  sensitivity 
controllability,  parameter  identification  techniques,  local  parameter 
identiflabillty,  and  input  design  for  improved  identification),  and  some 
new  techniques  have  been  suggested  (e.g.,  an  operator  approacn  to  minimum 
sensitivity  control  design  and  selection  of  an  initial  condition  vector 
to  optimize  Identification  capability  from  the  zero-input  response). 

But  the  primary  contribution  of  this  section  has  been  to  present  a uni- 
fied operator  approach  for  treating  parameter  sensitivity  and  various 
sensitivity-related  system  properties. 

This  section  lays  the  groundwork  for  our  application  to  linear 
ordinary  differential  equation  systems  considered  in  the  subsequent 
sections.  We  concentrate  on  time-invariant  systems  because  It  Is  here 
that  a new  algebraic  description  of  the  parameter  sensitivities  pro- 
vides convenient  geometric  and  computational  tools  for  treating  the 
system  properties.  This  algebraic  description  of  the  parameter 
sensitivity  operators  is  developed  in  the  next  section. 
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Section  III 

SENSITIVITY  OPERATORS  FOR  LINF-AR  ORDINARY  DIFFERENTIAL  EQUATION  SYSTEMS 

In  this  section  operator  expressions  are  derived  for  parameter 
sensitivity  of  continuous  linear  ordinary  differential  equation  systems. 
Both  time-varying  and  time- Invariant  systems  are  considered.  These 
results  Illustrate  the  operator  representation  of  the  parameter  sensiti- 
vities which  was  presented  In  Section  II  for  the  Hilbert  space  linear 

system  S . Furthermore,  these  results  form  the  foundation  for  our  sub- 
H 

sequent  discussion  of  the  sensitivity-related  system  properties  In 
Sections  IV  - VII. 

The  organization  of  this  section  Is  the  following:  In  Section  III.l 
operator  expressions  are  derived  for  the  parameter  sensitivities  In  time- 
varying  linear  systems.  The  key  to  this  operator  representation  Is  to 
obtain  the  partial  derivatives  of  the  state  transition  matrix  with 
respect  to  the  parameter  components.  A general  theorem  concerning  these 
partial  derivatives  Is  proved;  however,  for  systems  with  a time-varying 
plant  matrix  the  computational  aspects  of  the  operator  representation 
are  less  desirable  than  those  for  computing  the  sensitivities  from 
sensitivity  system  differential  equation  techniques.  In  Section  III. 2 
operator  expressions  are  derived  for  the  parameter  sensltlvlj^^les  In 
linear  time-invariant  systems.  Again  the  key  factor  In  this  representa- 
tion Is  taking  the  partial  derivatives  of  the  state  transition  matrix, 
e^^.  General  polynomial  expressions  for  these  partial  derivatives  are 

obtained  and  useful  relationships  to  eigenvalue  sensitivities  are  shown. 

At 

This  polynomial  description  of  the  partial  derivatives  of  e is 
used  to  provide  a fundamental  matrix-operator  representation  of  the 
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parameter  sensitivities  In  linear  time 'Invariant  systems.  This  form 
proves  to  be  computationally  useful  and  also  provides  considerable 
geometric  Insight  for  the  study  of  various  sensitivity  related  system 
properties.  These  computational  and  geometric  features  are  examined 
In  Sections  IV  - VII. 

1.  TIME-VARYING  SYSTEMS 

In  this  section  we  derive  operator  expressions  for  the  parameter 

sensitivities  of  the  time-varying  linear  system,  S™  ' 

V 


x(t;u)  * A(t)x(t;u)  + B(t)u(t)  (84) 

x(t^)  = d t e [t^,  t^]  (85) 

x(t;u)  E R*'  u(*)  e L2(t^.  t^;R’')  (86) 

with  observable  output 

y(t;u)  = C(t)x(t;u)  (87) 

y(t;u)  e r"  t e [t^,  t^]  (88) 


where  the  matrices  A,  B,  C and  the  Initial  condition  vector  d are  assumed 

to  be  real  and  parameterized  by  the  real  p-dlmensioned  parameter  vector  b 

with  nominal  value  b e R^.  Again  we  will  observe  the  convention  of  not 
o 

showing  the  explicit  dependency  upon  b unless  It  is  required"  for  clarity 

and  It  will  always  be  assumed  that  quantities  are  evaluated  at  b^  unless 

Indicated  otherwise.  For  all  t c [t  , t,]  and  at  b It  Is  assumed  that 

o t o 

A(t),  B(t),  C(t),  and  d,  are  each  continuously  differentiable  with 
respect  to  every  parameter  component,  b^^,  1 “ 1,  2,  ...p.  Furthermore, 
for  all  b e R^  within  some  neighborhood  of  b^  e R^,  It  Is  assumed  that 
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A(.;b),  and  C(.;b)  and  all  p of  their  partial  derivatives  are 

uniformly  bounded  and  piecewise  continuous  on  the  finite  Interval 

[t©.  tj]. 

Under  these  assumptions  (Ref  29)  the  sensitivities  satisfy  the 
following  linear  "sensitivity  system": 


X(t;u)  “ A(t)X(t;u)  + B(t)u(t) 

X(t  ;u)  - d t e [t  , t,] 

O ox 

Y(t;u)  = C(t)X(t;u) 


where 


X(t;u) 


x(t;u) 

C^^^(t;u) 


Y(t;u) 


y(t;u) 

v^^^(t;u) 


v^^\t  ;u) 


A(t)  - 


rA(t) 

0 

. . 0 

B(t) 

> 

ft 

A(t) 

• 

B(t)  - 

• 

• 

• 

a 

0 

A(t) 

• 

ft 

O 

• 

a 

a 

o 

1 

C(t)  - 

C(i)(t)  C(t) 

a a a 

d “ 

‘*(1) 

a 

a 

a a a 

C^pj(t)  0 ...  C(t) 

1 

c. 

a W 

^ 

(89) 

(90) 

(91) 


(92) 


(93) 


(94) 
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and  5^^^(t;u)  and  v^^^(t;u),  i ■ 1,  2,  ...p,  are  the  state  and  output 
sensitivities  respectively.  Therefore,  the  solution  of  the  sensitivity 
system  requires  solving  n(p  + 1)  coupled  linear  differential  equations. 

However,  we  wish  to  obtain  an  operator  solution  of  the  parameter 
sensitivities  as  described  In  Section  II. 1.  Therefore,  consider  the 
operator  solution  of  the  system  S^: 


x(t;u)  « ♦(t,t  )d  + / 'I>(t,s)B(s)u(s)ds 
° t 

o 


y(t;u)  » C(t)  x(t;u) 

- T(t)d  + W(t)u 


(95) 


(96) 


where  4>(t,t^)  Is  the  state  transition  matrix  of  A(»)  and  the  zero-input 

and  zero-state  operators  T(t)  e L (r”,R™)  and  W(t)  e L-(L  (t  ,t;R'^),R'°) 

c z c o 

are  defined  by 


T(t)  - C(t)$(t,t^) 


W(t)  = C(t)4(t,s)B(s)  (Ods 

t 

o 


(97) 

(98) 


respectively.  For  discussion  we  will  deal  almost  exclusively  with  the 
output  sensitivity  operators,  and  these  are  given  by  the  operator 
expressions 


(99) 


(100) 


To  compute  the  partial  derivatives  of  T(t)  and  W(t)  requires  th/.t 
the  partial  derivatives  of  the  state  transition  matrix,  'l>(t,s),  be  ob- 
tained. The  next  theorem  establishes  the  existence  and  continuity  tf 
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these  partial  derivatives.  We  comment  that  this  theorem  is  a well-knovm 


result  for  the  case  in  which  A(*)  is  time- invariant  (Bellman  (Ref  2)) 


however,  our  proof  for  this  time-varying  case  is  different  than  the  one 


used  by  Bellman,  and  so  the  proof  is  instructive  in  its  own  right 


Theorem  1 


Suppose  for  all  t e [t  , t,]  at  the  nominal  parameter  value 


,nxn 


is  continously  differentiable  with  respect^  to  each 


parameter  component,  b , 1 » 1,  2 


neighborhood  of  b 


uniformly  bounded  and  piecewise  continuous  in  t.  Next,  suppose  't’(t,s) 


is  the  state  transition  matrix  of  A(t);  that  is,  for  all  t,  s e [t  , t,] 


4>(t,s)  is  the  unique  solution  of  the  matrix  differential  equation 


Then  for  each  t,  s e [t  , t,]  the  partial  derivatives  '!>^,>(t,s) 


p,  may  be  obtained  either  from  the  integral  equation 


(t,s)  - <Kt,s)  / ♦(s,C)A,..  (f;)4'(C,8)dC 


or  the  matrix  differential  equation 


(s,s)  “ 0.  Furthermore,  these  partial 


with  boundary  condition  $ 


derivatives  are  continuous  in  b 


Proof 


Let  A (t)  denote  A(t;b  + Ab.e.)  where  e,  is  the  1 Euclidean 


basis  vector  and  '!>.  (t,s)  denote  the  unique  solution  of  the  linear 


matrix  differential  equation 
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~ ♦ (t,s)  - A.  (t)*  (t,8) 

dt  A^ 


(105) 


9.  (s.s)  - I 


(106) 


Assuming  that  b + Ab.e.  Is  contained  In  the  neighborhood  In  which 
O 11 

A(t;b)  Is  uniformly  bounded  and  piecewise  continuous,  then  both  4(t,s) 
and  (t,s)  will  be  uniformly  bounded  for  all  t,  s e [t^,  t^].  The 
differential  equation  for  (t,s)  mav  be  written  as 

^ ♦^^(t.s)  - A(t)*^^(t,s)  + (A^^(t)  - A(t))*^  (t.s)  (107) 


which  has  the  unique  solution 


(t,s)  - <Kt,s)  + ♦(t.s)  /‘•♦(s.OCA.  (O  - A(c))^.  (C,8)d!;  (108) 
^1  s ^1  ^1 

Using  the  Bounded  Con^{;ergence  Theorea  (Ref  25:  81)  and  the  fact  that 
A(.;b)  Is  continuous  In  b at  b^,  the  limit  of  both  sides  of  equation 
(108)  gives 


|A0j  - 0 ^ 


♦ (t.s)  - ♦(C’.s) 

A. 


(109) 


Hence.  ♦Tt.s)  Is  also  continuous  In  b at  b . 

o 

since  the  partial  derivatives  of  A(t;b)  are  uniformly  bounded  In  a 
neighborhood  by  b^.  we  may  use  equation  (106)  and  again  apply  the 
Bounded  Convergence  Theorem  to  the  definition  of  the  partial  derivative 


to  obtain 


♦(l)(t.8)  = 11m 


♦ (t.s)  - ♦(t.s) 


lAb^l  ^ 0 


- ♦(t.s)  /’'♦(s.i:)A^j(04'(c.8)di: 

6 

56 


(110) 


AFAL-TR-76-118 


Since  assumed  continuous  In  b,  all  the  factors  on  the  right 

hand  side  of  equation  (110)  are  continuous  In  b.  Thus  also 

continuous  In  b at  b . 

o 

Finally,  the  differential  equation  representation  of 
follows  by  application  of  Leibnitz  Rule  to  equation  (103) . 

Q.E.D. 

Because  of  the  uniform  bounds  assumed  on  the  partial  derivatives  of 
A(*)  and  B(«)>  the  partial  derivative  may  also  be  brought  Inside  the 
Integral  In  equation  (98).  Therefore,  the  partial  derivatives  of  the 
operators  T(t)  and  W(t),  equations  (97)  and  (98),  may  be  described  In 
terms  of  the  partial  derivatives  of  C(t),  B(t),  and  $(t,t^)  where  the 
latter  partial  derivative  Is  computed  from  Theorem  1.  These  may  then 
be  used  In  equations  (99)  and  (100)  to  give  an  operator  expression  for 
the  zero-input  and  zero-state  sensitivities.  However,  because  of  the 
convolution  In  equatloVi  (98),  this  Is  not  a particularly  practical  way 
for  computing  the  parameter  sensitivities  for  this  time-varying  plant 
matrix  case.  For  the  time-invariant  system  considered  In  the  next  sec- 


tion, these  undesirable  convolutions  may  be  transformed  Into  computa- 
tionally efficient  quadratures. 


2.  TIME-INVARIANT  SYSTEMS 

In  this  section  we  assume  that  the  A,  B,  and  C matrices  In  equa- 
tions (84)  and  (87)  are  time- invariant.  Also,  without  loss  In  generality 
we  will  assume  that  t^  ••  0.  This  time-invariant  ordinary  differential 
equation  system  will  be  denoted  s^q.  Like  the  tlme-var’-*ng  system  of 
the  previous  subsection,  the  key  to  obtaining  a practical  operator  form 

of  the  parameter  sensitivities  will  be  an  efficient  way  to  compute  the 

At 

partial  derivatives  of  the  state  transition  matrix,  ♦(t)  ••  e . Theorem  1 
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could  be  utilized;  however,  for  the  time- Invar lent  A matrix  we  are  able 
to  obtain  far  more  fundamental  and  useful  forms  of  these  partial 
derivatives. 


At 

a.  Mathematical  Description  of  e^^j  “ <l>^^j(t) 

Consider  the  system  matrix  A e r"”*  at  the  a priori  parameter  value 

b e R^.  The  following  notation  will  be  used  throughout  this  section, 
o 

Suppose  that  A has  characteristic  polynomial 


A(q)  - det  (ql  - A)  » II  (q  - q^) 


and  u order  minimal  polynomial 


ij)(q)  - n (q  - qj^) 


(111) 


(112) 


where  q^,  k ■ 1,  2,  ...p,  are  the  distinct  eigenvalues  of  A; 

^ ^ 1 are  the  multiplicities  of  q^^  in  the  characteristic  and  minimal 

P P 

polynomials  respectively;  and  p » E p.  and  n * Z n.  . 

k-1  k-1  ^ 

The  system  state  transition  matrix  may  then  be  represented  as 


*(t)  - e**"  - E E 
k-1  j-0 


(113) 


where  the  n x n matrices  Z.  , are  termed  the  components  of  M (Ref  33: 

J 


604).  An  equivalent  representation  Is  (Ref  33:  609) 


i-1 

r a3  aj(t) 


where  the  real,  scalar,  linearly  independent  functions,  aj(‘)» 

J - 1,  2,  ...p,  are  uniquely  determined  from  the  p linear  equations 


(114) 
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V ri  2 w-l, 

t-'e  - [1  q q ...  q 


aj(t) 

OjCt) 


(115) 


a (t) 

W 


Ic*  Xy  2|  •••p 
j - 0,  1,  ...Wj^  - 1 


We  shall  presently  determine  representations  for  the  partial  derivative 
At 

e^^^  which  are  equivalent  to  each  of  the  above  forms. 

Now  consider  the  augmented  "sensitivity  system": 


x(t;u)  A 0 x(t;u)  | jB  j 

C^^^(t;u)  A^^j  A e^^^t;u)J 

’x(O)  d ^ 

e'‘>(o)J  ■ [d,j, 


u(t)  (116) 


(117) 


The  augmented  matrix 


A^i)  A 


(118) 


is  of  fuhdamental  Importance  In  describing  the  operator  solution  of  this 
1^^  sensitivity  system,  (116),  and  we  shall  obtain  several  useful 
properties  concerning  parameter  sensitivity.  In  general,  by  considera- 
tion of  this  augmented  matrix.  Indeed,  the  first  theorem  provides  the 
foundation  for  all  of  our  remaining  results  on  parameter  sensitivity  for 
the  linear  tlrae-lnvarlant  system, 


Theorem  2 


Suppose  A c R Is  continuously  differentiable  with  respect  to 
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b.  At  b . Then 
X o 


At 

®(1) 


-At 


(119) 


Proof 


The  state  transition  matrix  for  A^  may  be  uniquely  determined  from 


d 1 
dF* 


-V 


. At 
■ A^e 

(120) 

into 

four  n X 

: n blocks 

9ll(t) 

♦l2(t)' 

(121) 

92i(t) 

*22^^^ 

it  Is  easy  to  use  (120)  to  show  that 


*11^*'^  " It  “ It 


(122) 


*22 ® ^ ^ 


(123) 


and 


^(l)*^’"'  *21^°^  “ 

Then  from  Theorem  1,  *21^^^  “ *(1)^^^*  1®  complete. 

Q. E .D . 

Theorem  2 establishes  the  basic  relationship  from  which  we  obtain 

At 

various  representations  for  the  partial  derivatives,  shows  the 

fundamental  Importance  of  the  augmented  matrix  A^  In  the  determination 
of  the  1*^^  partial  derivative  of  e^^.  Examining  this  augmented  matrix 
we  see  that  Its  characteristic  polynomial  Is 
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ql  - A 

0 

A.(q)  - det 

* -A 

*(1) 

ql  - A 

9 P 

■ 

> 

1 

1 

•o 

- ‘‘k^ 

(125) 


Thus  the  eigenvalues  of  are  the  same  as  those  of  A and  this  is  true 
for  each  parameter  component  1 ■ 1,  2,  ...p.  Of  course  the  minimal 
polynomials  of  the  A^'s  may  be  different  for  each  parameter  component, 
bj^,  and  so  let  us  suppose  that  A^^  has  a - order  minimal  polynomial 


P “1^ 

- n (q  - qj^) 
k-1 


(126) 


We  shall  investigate  the  eigenvalue  multiplicities,  p.  , k ■ 1,  2,  ...p, 

\ 

further,  but  for  now  the  above  notation  Is  used  to  give  the  following 
result: 


Corollary  2.1 


P 

At  ^ - 

®(1)  " ^ ^ \ 
k-1  j-0 


(127) 


where 


-C] 


(128) 


2n  X n 


and  Z are  the  components  of  A, . 

Vj 

Proof 

This  Is  an  Immediate  consequence  of  Theorem  2. 

Q.E.D. 

Corollary  2.1  Is  the  primary  representation  which  will  be  used  In 

our  computational  applications,  and  Appendix  A discusses  some  computa- 
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tlonal  techniques  for  determining  the  matrices  V.  . However,  for 
investigation  of  some  of  the  geometric  properties  of  the  sensitivity 
operators,  the  next  corollary  will  find  principle  application. 


Corollary  2.2 


2n,  are  determined  from 


The  corollary  follows  Immediately  from  Theorem  2 upon  using  the 
well-known  result  that  (e.g.,  (Ref  51)) 


Before  considering  the  application  of  Corollary  2.2  to  the  defining 
relations  for  the  sensitivity  operators,  some  fundamental  Information 
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concerning  the  partial  derivatives  e^^  can  be  gained  by  examining  the 
minimal  polynomials  of  A^. 

b.  The  Minimal  Polynomial  of  A^  - General  Case 

The  following  lemma  summarizes  a useful  way  to  determine  the  minimal 
polynomial  of  A^.  Notice  that  the  only  assumption  on  the  A mairlx  for 
this  lemma  Is  that  the  partial  derivative  exist.  Indeed,  one  could 

substitute  the  matrix  with  any  n x n matrix  and  the  conclusions  of 
the  lemma  would  still  remain  valid. 

Lemma  6 

Let  A(q)  and  t{)(q)  be  the  characteristic  and  minimal  polynomials, 
respectively,  of  A as  defined  by  equations  (111)  and  (112). 

I)  Suppose  that  g(q)  Is  the  polynomial  such  that  A(q)  ■ <('(q)g(<l), 
and  define  the  reduced  adjunct  matrix^ 

IdJ  Iql  - A]  5 adj  [ql  - Al/g(q)  (133) 

Let  g^(q)  be  the  largest  common  factor  of  both  adjlql  - AjA^^adj  (ql  - A] 
and  the  minimal  polynomial  i|/(q).  Then  i{i^(q)  ■ (q)/gj^(q)  Is  the  minimal 

polynomial  of  A^,  equation  (118). 

II)  The  order  of  <{'^(q)  Is  between  the  bounds 

W 1 £ 2u  (134) 

and  each  eigenvalue  multiplicity  has  similar  bounds 

\ 1 i*!  < 2u  . (135) 

k 


^Following  Nering  (Ref  21:  95)  we  term  the  "adjunct  matrix"  of  A as  the 

matrix  adj  (A)  " (Aij)^  where  Ajj  Is  the  cofactor  of  the  element  a^.  of  A. 
The  adjunct  matrix  Is  also  commonly  referred  to  as  the  "adjoint"  matrix, 
and  we  use  this  terminology  to  avoid  confusion  with  the  adjoint  operator. 
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Proof 

Part  1)  Is  derived  by  considering  Che  2n  x 2n  partitioned  adjunct 
matrix 

adj 


ql  - A 

° 1- 

. r-u 

(136) 

■^(1) 

ql  - aJ 

*-‘*12  ^22- 

Using  the  fact  that  for  <1  ^ 


Iql  - Al  adj  tql  - A]  - A(q)I 
one  may  form  a set  of  four  linear  equations  from 


ql  - A 0 1 

r**!!  **21 

-A^^j  ql  - aJ 

L<1i2  ^22 

A^q) 


[:  :i 


(137) 


(138) 


and  show  chat 

ql  - A 0 


adj 


[ql  - A 0 . 1 

-A^^j  ql-Aj 


A(q)  adj  [ql  - A)  0 

adj  Iql  - A]  A^j^  adj  Iql  - A]  A(q)  adj  Iql  - A] 


] 


(139) 


Mow  g(q)  is  the  largest  common  factor  of  every  term  of  adj  [ql  - A] 

(Ref  21:  101),  and  so  the  largest  common  factor  of  adj  [ql^  A^]  will 

2 

be  g (q)g^(q)  where  g^(q)  la  as  described  In  the  lemma.  Then  the 
minimal  polynomial  of  la 

♦j(q)  - A^(q)/g^(q)g^(q)  - i|>^(q)/gj^(q)  (140) 

Part  11)  of  Che  lemma  then  follows  immediately  from  equation  (140) 


and  the  definition  of  g^(q). 
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Q«E»D* 


The  minimal  polynomial  of  the  augmented  matrix  has  been  considered 

previously  by  Cuardabassl,  Locatelll,  and  Rinaldi  (Ref  51).  By  examining 

the  Jordan  canonical  form  of  A^  they  obtain  similar  bounds  on  the  order  and 

multiplicities  of  the  minimal  polynomial^.  They  obtain  slightly  tighter 

bounds  on  the  multiplicities  of  the  eigenvalues  (they  conclude  that 

p < 2u.  -1+6,  where  u.  Is  the  eigenvalue  multiplicity  q.  and 

6,  Is  the  Kronecker  delta);  however,  the  results  of  Lemma  6 are  completely 
^’*'k 

general  requiring  no  assumptions  concerning  the  parameters  In  A other  than 
the  existence  of  the  partial  derivatives,  whereas  Guardabassl,  et  al,  place 
the  restriction  on  the  structure  of  A that  the  dimension  of  Its  Jordan 
blocks  and  number  of  distinct  eigenvalues  ate  Invariant  In  a neighborhood 
of  the  nominal  parameter  value,  b^.  The  following  simple  example  Illustrates 
how  Lemma  6 might  be  utilized  and  shows  that  the  restrictions  assumed  by 
Guardabassl,  et  al,  are  Indeed  necessary  In  order  to  obtain  their  slightly 
tighter  bounds.  Also,  this  example  will  provide  Insight  Into  our  next 
subsection  In  which  we  consider  a special  case  similar  to  the  one  which 
Guardabassl,  et  al,  considered. 


Example 


Suppose  that  A(b)  Is  the  A x 4 matrix 


10  0 


bj^  1 0 0 

0 0 1b, 

.0  0 0 1 


^Actually  reference  51  considers  the  minimal  polynomial  of  A,  equation  (6); 
however.  Lemma  6 Is  easily  extended  to  this  case  and  part  11)  of  the  lemma 
remains  true  regardless. 
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where  the  nominal  values  of  and  are  zero.  Then  A(q)  = (q  ~ 1)^ 

2 

and  = (q  - 1)  . The  adjunct  matrix  of  [ql  - A]  is  readily  computed 

to  be 


adj  [ql  - A]  “ (q  - 1)'^  (q  _ i)  i 


(q  - 1)  0 


(q  - 1)  0 


(q  - 1) 

» 


(142) 


The  partial  derivatives  of  A are 


0 0 0 0 


0 0 0 0 


10  0 0 


0 0 0 0 


0 0 0 0 


0 0 0 1 


0 0 0 0 


0 0 0 0 


and  so 


adj  [ql  - AjA^^adJ  [ql  - A]  = 


(q  - 1) 


0 0 


(q  - 1)  (q  - 1)  0 0 
0 0 0 0 


0 0 


adj  [ql  - A]A^2)3dj  [ql  - A] 


0 0 0 0 

0 0 0 0 

0 0 0 (q  - D' 

0 0 0 0 


Therefore,  by  lemma  6 the  minimal  polynomial  of  A^  is 

^j(q)  - >^(q)/l  = (q  - 
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and  of  A^  Is 

412 (<j)  “ 4'^(q)/(q  - 1)^  * (q  - i)^  » <l'(q).  (147) 

- 2 

Thus,  this  example  has  the  two  extremes  In  which  'I'j^Cq)  “ 4 (q)  and 
4/2 (q)  ^ ^(q)-  Notice  that  any  value  of  other  than  zero  produces  the 
characteristic  polynomial 

4^,^(q)  = (q  - l)^(q^  - 2q  + 1 - bj)  (148) 

and  minimal  polynomial 

4’.  (q)  “ (q  - l)(q^  - 2q  + 1 - b.)  (149) 

“1  ^ 

whereas  a perturbation  of  82  has  no  effect  on  the  minimal  polynomial  of  A. 

c.  Special  Case  - Invariance  of  the  Minimal  Polynomial  of  A 
Assume  that  A has  the  property  that  the  structure  of  Its  minimal 
polynomial  is  Invariant  for  all  parameter  component  perturbations  in  a 
neighborhood  of  the  nominal  b^.  There  is  little  loss  In  generality  in 
this  case  from  that  of  the  previous  section,  as  it  happens  very  seldom  in 
practice  that  a local  parameter  perturbation  will  change  the  structure  of 
the  minimal  polynomial.  Also,  this  case  is  slightly  more  general  than 
the  case  considered  by  Guardabassi,  et  al,  (Ref  51),  as  the  structure  of 
the  Jordan  blocks  of  a matrix  can  change  without  affecting  ^e  minimal 
polynomial  but  not  vice  versa.  The  following  theorem  summaries  a funda- 
mental relationship  between  the  minimal  polynomial  of  A^  and  the  b^^- 
clgenvaluc  sensitivities  for  this  special  case: 

Theorem  3 

Suppose  that  A has  p^^-ordor  minimal  polynomial,  4/(q),  equation  (112); 
A is  continuously  differentiable  with  respect  to  at  b^;  and  for  all 
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lAbj^I  £ e^,  A^  ■ ^ minimal  polynomial 


^ ^1 
(q)  =»  n (q  - qj^^) 

^ k-1 


where  q^^  ■*■  q^^  as  [Ab^l  -►  0,  k » 1,  2,  ...P. 
Then: 


At  ^ ‘*k*= 

=“)■  ' '\.0>(1)" 

k-1 


P 

k-1  J-l 


J.v 


“ \ V 

* ^ \.p,-iV<i)^  ' 
k-1 


where  ^ are  the  components  of  A and  (Zj^  j^(l)  ^^'k^l) 

component  and  eigenvalue  sensitivities  (partial  derivatives) 
to  b^.  These  partial  derivatives  exist  and  are  continuous. 
11)  The  minimal  polynomial  of  the  augmented  matrix  A. 


ipj^Cq)  - n (q  - qj^) 
k-1 


where 


Mk  + 1 If  (qi,)^)  * 0 


ill)  The  order  of  the  minimal  polynomial  of  A^  is  u + 
Is  the  number  of  eigenvalues  of  A which  have  a non-zero  b 


(150) 


(151) 

are  the 
with  respect 

Is 

(152) 

(153) 

Bj^  where 
^ eigenvalue 


sensitivity. 
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Proof 


Prom  the  component  representation  of  e^^  we  have 


At 


P 

ml  £ 

k-1  J-0 


‘‘k*=  1 
"k.j®  " ■ 


(15A) 


Similarly,  by  the  assumption  on  A 

1 ^4  i 

*1  ’k  ' 


A . t 

^1  ^ 
e - E 


k-1  J-0 


It  Is  true  that 
t^ 


(155) 


“4 

where  Z.  . are  the  component  matrices  of  A « Then  by  the  definition  of 

K,i 

the  partial  derivative,  we  obtain: 


P 

Z 

k-1 


Z 11m 
J-0  lAb^l 


^1 

Ab. 


(156) 


Since  A Is  continuously  differentiable  with  respect  to  b^^,  by  Theorem  1 
At 

It  Is  true  that  e^^^  exists  and  Is  continuous.  Thus  the  limits  on  the 
right  hand  side  of  equation  (156)  must  also  exist  and  be  continuous. 

By  proper  manipulation  expression  (151)  readily  follows.  Then  since 
the  compone  :s  Z.  ^ cannot  be  zero  by  definition  of  the  minimal 
polynomial,  parts  11)  and  111)  follow  Immediately  from  1). 

Q .£ .D . 

The  conclusions  of  Theorem  3 are  an  extension  to  those  of  Guardabassl, 
et  al  (Ref  51)  (Ref  52).  In  (Ref  51)  upper  bounds  on  the  multiplicities 
of  the  eigenvalues  of  the  augmented  matrix  A,  equation  (7),  are  obtained 
(see  the  comment  following  Lemma  6).  Then  In  (Ref  52)  the  special  case 
In  which  there  Is  no  repeated  eigenvalues  In  the  A matrix  Is  analyzed  In 
detail.  The  detailed  structure  of  the  Jordan  canonical  form  of  A Is 
given,  and  the  structure  of  these  blocks  Is  dependent  upon  the  eigenvalue 


sensitivities.  In  both  papers  the  conclusions  are  reached  through  an 
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analysis  of  the  Jordan  canonical  form  of  A.  Thus,  Theorem  3 extends  the 
conclusions  of  these  pre' lous  papers  by  obtaining  definitive  relation- 
ships between  the  eigenvalue  sensitivities  and  the  minimal  polynomial  of 
the  augmented  matrix  (the  extension  to  the  minimal  polynomial  of  A 
Is  straight-forward  - see  Corollary  3,1),  and  this  Is  accomplished  for 
the  more  general  case  (repeated  eigenvalues).  Also,  it  Is  significant 
to  note  that  the  approach  taken  in  deriving  Theorem  3 Is  considerably 

different  from  that  of  References  51  and  52,  In  that  a direct  tflf ferentla- 
At 

tlon  of  e Is  utilized  rather  than  the  Jordan  canonical  form  analysis 
used  by  Guardabassl,  et  al. 

Comparing  equation  (127)  of  Corollary  2.1  with  equation  (151)  of 


Theorem  3 we  see  that 


Since  the  n x n matrices  V and  Z,  . may  be  determined  independ- 

ently  from  the  components  of  the  matrices  and  A,  respectively,  the  rela- 
tion (165)  provides  a very  general  and  straight-forward  means  for  deter- 
mining the  eigenvalue  sensitivity  Itself.  Most  of  the  existing 

literature  on  eigenvalue  sensitivity  has  concentrated  on  the  special  case 
In  which  A has  distinct  eigenvalues  or  the  parameter  appears  linearly  in 
one  or  more  elements  of  the  A matrix  (Ref  43)  (Ref  44).  The  relation 
(157)  places  only  the  restrictions  assumed  by  Theorem  3^-  We  note  that 
Guardabassl,  et  al,  also  derive  a general  expression  for  computing  the 
eigenvalue  sensitivities  based  upon  the  Jordan  blocks  of  the  canonical 
form  of  A. 

Next,  if  the  conditions  of  Theorem  3 arc  satisfied  for  all 


bj,  1 " 1,  2,  ...p,  and  if  we  let  6 p be  the  number  of  eigenvalues 


J 
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of  A which  have  a non-zero  sensitivity  for  at  least  one  h . , 1 “ 1,  2 


p It  Is  possible  to  represent  e 


order  polynomial  as  given  by  the  next  corollary 


Corollary  2.3 


Assume  the  conditions  of  Theorem  3 are  true  for  all  parameter 


Let  6 be  the  number  of  eigenvalues  which 


components 


have  at  least  one  non-zero  parameter  sensitivity  for  b 


1)  The  minimal  polynomial  of  the  n(p  + 1)  x n(p  + 1)  augmented 


matrix  A (defined  in  equation  (7))  Is 


where 


where  the  real  scalar  functions  «.(•).  J “ 1>  2 


are  determined 
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Proof 


Extending  Theorem  2 It  can  readily  be  shown  that 


At 


-At 


At 

®(1) 


At 

^P) 


At 


0 

0 


At 


(161) 


ln(p  + l)x  n(p  + 1) 


If  A has  the  minimal  polynomial 


At 


P 

n 

k-l 


n (q  - qj^) 


(162) 


then  e has  the  unique  component  representation 


At 


P 

Z 


jV- 

Z t^e  Z 


k-l  J-0 


k.j’ 


(163) 


where  the  component  matrices  Z.  . are  partitioned  so  that  the  relationship 

— » j 

(161)  is  satisfied.  But  by  this  relationship  and  the  assumptions  of  the 
corollary  it  must  be  true  that  for  any  particular  eigenvalue,  qj^, 
k-l,  2,  ...p,  the  highest  power  of  t in  the  expression  (163)  will  be 
either  - 1 if  (qy^)  “ 1?  2,  ...p  oi  it  will  be 

otherwise.  Thus  the  conclusions  of  part  1)  readily  follow. ~ 

Fart  11)  is  then  merely  an  application  of  the  conclusions  of  1). 

Q.E.D. 

Thus  Corollary  2.3  may  be  used  in  lieu  of  Corollary  2.2,  and 
the  number  of  linear  equations  which  need  to  be  solved  (to  determine  the 
scalar  functions  SjC*)  In  tbe  former  case  and  the  functions  Oj(‘)  in  the 
latter)  will  be  reduced  from  2n  to  p.  If  there  are  many  repeated  elgen- 
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values,  then  the  difference  could  be  significant,  particularly  since 
solving  the  set  of  required  linear  equations  will  generally  Involve  the 
Inversion  of  a generalized  Vandermonde  matrix  of  the  corresponding 
dimension.  However,  the  use  of  Corollary  2.3  requires  considerably  more 
structural  knowledge  than  Corollary  2.2  and  this  structural  knowledge 
probably  far  outweighs  the  benefits  of  the  reduced  order  Vandermonde 
matrix. 

We  conclude  this  subsection  with  a simple  example  to  help  ^tlvate 

ii 

and  clarify  the  concepts  which  have  been  discussed. 
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the  partial  derivatives  In  the  form  of  Corollary  2.1  as  follows 


At 

^3) 


V,  + V • t + V e^  + V te'^ 

^1,0  ^1,1  2,0  ■’2,1 


Then  using  equation  (157)  to  give  the  expressions  that 


■ ‘’I’d)  ^1,0 

■ '"z’d)  ^2,0 


we  see  that  the  eigenvalue  sensitivities  are  given  by 
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'‘*2^(3)  “ ^ 


Then  since  both  and  have  a non-zero  sensitivity  for  at  least  one 
parameter  component,  b^,  b^,  or  b^,  the  minimal  polynomial  of  A is  given 


♦ (q)  * (q  - (q  - - q^(q  - 1)^. 


(176) 


Thus  the  order  of  the  minimal  polynomial  ofAisp“u+2“A. 


d.  Parameter  Sensitivity  Operators  for  System  Sj^^ 


In  the  previous  sections  a number  of  basic  mathematical  representa- 


tions of  the  partial  derivatives  e^^  were  developed.  These  will  now  be 


used  to  provide  a convenient  operator  description  of  the  sensitivities 


for  the  system  S In  particular,  by  directly  utilizing  equations  (96) 


(100)  and  Corollary  2.2  we  obtain  the  fundamental  algebraic  description 


of  these  parameter  sensitivity  operators*; 


Theorem  4 


For  the  system  S - at  the  a priori  parameter  value  b e it  is 
LL  O 


true  that 


y,  < ^ CA-*  da  (t) 

z.  X.  j 


(177) 


y_  _ Ct;u)"'  I CA  a,(t  - s)u(8)ds 

z . s.  0 J 


(178) 


y(t;u)-y  . (t)  + y,  „ (t;u) 

ZeO* 


'Note  that  Corollary  2.4  or  any  of  the  other  equivalent  polynomial 
representations  of  could  likewise  be  utilized  here. 


(180) 
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11)  (t)  - r (CA^"^d)(^jaj(t) 

J-1 

2n 

(t;u)-  t (CA^”^B),.v  • a (t  - 8)u(8)d8  (181) 

V*/  0 J 

i-1 

(t;u)-vf^J  (t)  + (t;u)  (182) 

ZaXa  ZaSa 


1 > 1,  2,  . . .p 

where  the  scalar  functions,  aj(t),  J ~ 1,  2,  ...2n,  are  uniquely  deter- 
mined by  the  2n  linear  equations  (131). 

Proof 

The  theorem  Is  a direct  application  of  Corollary  2.2  to  the 
defining  equations  (96)  - (110). 

Q . E .D . 

Theorem  4 provides  a useful  representation  of  the  parameter 
sensitivity  operators  for  the  linear  time-invariant  system  The 

geometric  aspects  of  this  description  are  examined  In  the  next  section 
In  conjunction  with  the  systems  properties  of  Insensitivity,  sensiti- 
vity controllability,  and  Identlflablllty.  Theorem  4 Is  also  a 
convenient  form  for  digital  computer  applications,  and  some  of  the 
computational  aspects  of  this  description  are  examined  In  Appendix  A. 
Note  that  the  entire  set  of  first  order  sensitivities  may  be  obtained 
with  at  most  2nr  ordinary  Integrals  (Appendix  A describes  how  the 
convolutions  Indicated  In  equations  (178)  and  (181)  may  be  transformed 
Into  time  varying  combinations  of  quadrature  Integrals).  The  computa- 
tional utility  of  this  representation  will  be  seen  even  more  clearly 

when  It  Is  used  to  form  the  basis  of  algorithms  for  quaslllnearlzatlon 
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parameter  identification  (Section  V),  minimum  aensltlvlty  control  design 
(Section  VI),  and  optimal  sensitivity  design  for  Improved  Identification 
(Section  VII).  Finally,  ve  note  that  a similar  polynomial  description 
for  the  second  and  higher  order  sensitivities  may  easily  be  obtained, 

,and  a discussion  of  this  Is  presented  In  Appendix  B. 

3.  SUMMARY 

In  this  section  we  have  used  an  operator  description  of  the  system 
output  to  obtain  the  sensitivity  operators  for  linear  ordinary  differential 
equations  systems  - both  time-varying  and  time- Invariant  ones.  These 
results  illustrate  the  operator  approach  used  In  Section  II  for  the  Hilbert 
space  system  S . The  key  to  making  the  operator  approach  practical  Is 
obtaining  the  partial  derivatives  of  the  state  transition  matrix  with 
respect  to  the  parameter  components.  Expressions  for  these  partial 
derivatives  are  derived  for  the  time-varying  systim  S^  (Theorem  1); 
however,  for  merely  computing  the  output  sensitivity  operators  their  use 
Is  probable  less  desirable  than  computing  the  sensitivity  operators  from 
the  "sensitivity  system"  differential  equations. 

For  the  tlme-lnvarlpnt  system  this  is  not  at  all  the  case. 

Here  polynomial  expressions  for  the  partial  derivatives  of  e^^  are 
derived  enabling  both  geometric  Insights  and  practical  computational 
techniques  to  be  obtained.  The  complete  system  output  and  all  p output 
parameter  sensitivity  operators  can  thus  be  obtained  through  matrix 
manipulations  and  the  solution  of  2nr  quadrature  Integrals  where  n is 
the  state  dimension  and  r Is  the  control  dimension.  If  mild  structural 
restrictions  on  the  minimal  polynomial  of  A are  assumed  in  a neighbor- 
hood of  the  nominal  parameter  vector,  then  explicit  relationships 
between  the  modes  of  the  sensitivity  operators  and  the  eigenvalue 

sensitivities  are  obtained.  These  expressions  also  result  in  a new 
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general  method  for  computing  the  eigenvalue  seusltlvltlea  themselves. 

These  structural  properties  of  the  sensitivity  operators  are  extensions 
of  those  obtained  previously  through  Jordan  canonical  form  analysis 
(Ref  51)  (Ref  52). 

In  the  next  four  sections  we  use  the  approach  of  Section  II  to 
examine  the  sensitivity-related  system  properties  of  insensitivity • 
sensitivity  controllability,  and  Identlf lability  (Section  IV)  and  to 
develop  computational  algorithms  for  quasll inear Izatlon  (Sectiofi  V), 
mininum  sensitivity  control  design  (Section  VI),  and  optimal  design 
of  the  sensitivity  operators  to  enhance  parameter  Identification 
(Section  VII).  We  concentrate  on  the  linear  time-invariant  system  and 
the  application  of  the  algebraic  representation  obtained  In  Section  III. 2. 
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Section  IV 


INSENSITIVITY,  SENSITIVITY  CONTROLLABILITY,  AND  IDENTIFIABILITY 


FOR  LINEAR  TIME- INVARIANT  SYSTEMS , S, 


eter  Insensitivity,  sensitivity  controllability  and  system  parameter 


Ident If lability  are  considered > The  basic  concepts  for  these  properties 


were  discussed  In  Sections  II. 3,  II. 4,  and  II. 7 for  the  Hilbert 


Our  reason  for  grouping  these  three  properties  In  this  section  Is  based 
upon  their  direct  geometric  reliance  on  the  algebraic  description  of  the 


tem  properties  are  Intimately  related  to  each  other,  and  these  relation- 


ships are  made  more  evident  by  the  algebraic  conditions  which  are  obtained 


1.  NOTATIONAL  CONVENTIONS 


where 
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g(t;u)  - a(t-8)*u(8)d8 

0 

(186) 

- [Cd(  CAd  j . . .| 

(187) 

- I(Cd)^j|  (CAd)(^j 

1 . . . 1 

(188) 

- [CB  1 CAB  j . • • 1 

(189) 

- [(CB)^^j  j (CAB)(j^^ 

1 . . . 1 , 

(190) 

Then  we  may  write  the  "total"  sensitivity  operators  as 

- [E^^^a(t)  E^^^a(t).  . .E^^\(t)] 

■ E*a(t) 

(191) 

and 

V (t;u)  H Iv^^^  (t;u). 

z.s. 

z.s.  mxp 

- [G^^^g(t;u). 

. .G^^^g(t;u)] 

- G*g(t;u) 

(192) 

where 

E - [E  E . . .E  iax2np 

(193) 

r . g(p), 

G - [G  G . , .G  J^2nrp 

(194: 

In  order  to  discuss  some  of  the  relationships  between  Insensitivity 
sensitivity  controllability,  and  Identlf lability.  It  will  be  helpful  to 
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' Am  m final  comment,  notice  that  ve  have  not  shown  the  explicit  de- 

pendency of  any  of  the  sensitivity  operators  or  matrices  upon  the  param- 

! eter  vector  b c Naturally,  If  these  matrices  and  sensitivities  are 

f 

to  be  computed,  they  must  be  evaluated. at  some  nominal  b^  e However, 

In  this  section  one  may  be  equally  concerned  with  the  system  properties 
of  Insensitivity,  controllability,  or  Identlf lability  for  Just  one  partic- 
ular value  b e R^  (a  local  condition)  or  for  nearly  all  b c R^  (a  'btruc- 
o 

tural"  condition^).  Because  we  obtain  algebraic  conditions  for  each  of 
these  three  system  properties,  these  conditions  may  be  made  loc'al  by 
evaluation  at  one  particular  b^  e R^  or  they  nay  be  made  "structural"  by 
leaving  the  appropriate  matrices  as  functions  of  b e R^.  Therefore,  we 
adopt  the  convention  that  theorems  and  definitions  will  all  be  stated 
without  regard  to  whether  they  are  local  or  structural;  It  the  conditions 


are  met  for  Just  one  b^  e R^  the  property  Is  local,  but  If  It  Is  true  for 
almost  all  b e R^  then  It  Is  structural.  I'he  later  simple  examples  of  this 
section  will  better  Illustrate  this  concept. 


2.  PARAMETER  INSENSITIVITY 

Using  the  previous  notation  and  the  fact  that  the  scalar  functions 
a^C*),  J * 1,  2,  ...2n  are  linearly  Independent  on  every  Interval  of 
finite  length  (and  hence  non-zero  almost  everywhere)  the  following  condi- 
tions for  Insensitivity  can  be  Immediately  obtained. 


Theorem  5 

The  system  S^^  is  zero-input  b^-component  parameter  Insensitive 
relative  to  the  output  (that  Is,  v^^]  (*)  ■ 0)  If  and  only  If 

Is  j 

(±)  J 

E'  ■ 0.  It  Is  totally  zero-input  Insensitive  (V  . (•)  ■ 0)  If 

J Z s Xs 

and  only  If  E ■ 0. 


^Gupta  and  Hall  (Ref  108)  Introduce  such  a "structural"  concept  In 
their  discussion  of  sensitivity  controllability. 
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Th«orein  6 

Th«  system  Is  zero-state  b^-component  parameter  Insensitive 
relative  to  the  output  (that  Is,  v^^^  (su)  • 0 for  all 

Z e 8 e J 

r ^ 

o(*)  c 12(0, t£;R  ))  If  and  only  If  - 0.  It  la  totally  zero-state 
Insensitive  If  and  only  If  G ■ 0. 

There  has  been  a significant  amount  of  research  Into  the  conditions 
for  parameter  Insensitivity  In  the  linear  time-invariant  system 
(See  Section  I.l.a.)  Indeed,  the  essential  results  of  Theorem  5 and  6 
have  been  derived  previously  by  "canonical  form  - minimal  order  sensitiv- 
ity system"  analysis,  (e.g.,  (Ref  61))  They  are  stated  here  mainly  to 
Illustrate  how  Theorem  A can  be  used  to  give  Immediate  geometric  condi- 
tions for  which  fairly  complicated  analysis  were  required  In  previous 
derivations.  However,  the  results  of  Theorems  5 and  0 are  new  In  that 
they  treat  zero-input  and  zero-state  Insensitivity  separately  rather  than 
jointly^,  and  they  add  the  generality  of  allowing  the  Initial  condition 
vector  to  be  dependent'  upon  the  unknown  parameter  vector,  b e R^. 

If  d Is  not  dependent  upon  b^,  then  Theorem  5 provides  the  means  to 
determine  the  largest  subspace  of  R°  from  which  an  Initial  condition 
vector  may  be  selected  In  order  to  ensure  that  v^^J  (•)  ■ 0; 


Corollary  5.1 

If  - 0.  th.n  v«>  ^(.) 
k-1 


0 If  and  only  If  d e 


where 


(201) 


To  Illustrate  the  application  of  Corollary  5.1,  one  might  utilize 


^Note  that  "zero-state  lasensltlvlty"  Is  what  Guardabassl,  et  al  (Ref  61) 
define  as  "hypo-lnsensltlvlty" 
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It  In  the  design  of  "transient  response"  experiments  for  parameter 
Identification.  One  of  the  problems  frequently  encountered  In  param- 
eter estimation  Is  that  there  are  too  many  unknown  parameters  to  effec- 


tively estimate  all  of  them  In  any  one  experiment  (see,  for  example. 


-(1) 


(Ref  85)).  However,  by  constructing  the  subspaces  1 > 1,  2,  ...p. 


j ■ 1,  2,  ...m,  the  possibility  exists  to  select  an  Initial  condition 
vector  which  will  zero  the  "Influence"  of  some  parameters  while  maximizing 
the  "Influence"  of  the  remaining  parameters.  Such  a procedure  might  be  a 
possible  way  to  reduce  the  number  of  "exciting"  parameters  In  a parameter 
Identification  experiment.  An  example  of  this  technique  Is  given  at  the 


end  of  this  section.  Also,  this  example  shows  that  the  zeroing  subspaces 


r(i) 


Zj  ' provide  fundamental  Information  concerning  Insensitivity  and 


Id entlf lability  for  both  the  zero-input  and  zero-state  responses.  This 
Is  an  Illustration  of  just  one  of  the  many  ways  In  which  the  properties 
of  Insensitivity  and  identification  can  be  related.  The  relationship 
between  Insensitivity  and  Identlflablllty  Is  examined  further  In  Section 
IV. 4. 


Example 

Suppose  that  C Is  the  2x2  Identity  matrix  and  that 


A - 


^ 0 


^>2  ^3 


(202) 


Then 


'>1  0 

.3 

bj  0 

A ■ 

bjbj  + bjbjbj  + bjb^  b’ 

(203) 


r(l) 


and  subspaces  Z^  , j ••  1,  2,  1 <■  1,  2,  3,  may  be  determined  from 
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V«  nay  Interpret  these  zeroing  subspaees  In  the  following  way:  the 


output  y^  Is  Insensitive  to  cne  parameter  component  wnenever  the  state 
z e R la  In  the  subspace  If  the  Initial  condition  d e and 


if  there  Is  no  control  Input,  then  the  output  y.  will  never  be  Influenced 


Notice  that  we  have  determined  these  sub' 


spaces  on  a "structural”  basl^;  that  Is,  the  subspaces  have  been  deter' 


mined  as  functions  of  the  parameter  values,  b.,  and  the  subspaces  change 


for  particular  values  of  the  parameters  (for  example,  when  b, 


As  we  commented,  these  subspaces  may  be  used  to  help  design  exnerl' 


ments  which  will  systematically  reduce  the  number  of  Influencing  components 
on  each  experiment.  For  example,  for  this  simple  second  order  system,  the 


output  y.  Is  never  Influenced  by  b,  or  b_  (since  Z 


the  output  y.  can  be  used  to  Isolate  b. 


{e,}  then  the  output  y.  Is  only  Influenced  by  b,,  and 


so  we  may  uniquely  estimate  b-  from  the  output  y_.  Then  once  estimating 


b.  and  b,,  we  may  then  select  d e {e. } and  use  the  output  y.  to  estimate 


For  this  simple  example,  these  conclusions  could  easily  be  made  by 


application  to  more  complex  systems.  Finally,  we  comment  that  a more 


coiaplex  system  may  prevent  a "scructural"  examination  of  the  various  sub' 


spaces;  however,  for  particular  values  of  the  parameters  the  matrices  and 


subspaces  are  easily  determined  on  the  digital  computer 


3.  SENSITIVITY  CONTROLLABILITY 


In  this  section  we  discuss  sensitivity  controllability  for  the 
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•ystam  S...  The  approach  la  baaed  upon  the  deflnltlona  and  resulca  of 
LC 

Section  II. 4 for  the  Hilbert  apace  aysten  Since  the  "aenaltlvlty  aya- 

tea"  for  S..  (aee  equation  (6)-(7))laa  linear  time- Invariant 
ordinary  differential  equation  ayatem,  controllability  properties  of  this 
system  may  be  determined  from  the  n(p  -f  1)  x n(p  -f  l)r  dimensioned 
"controllability  matrix"  (Ref  33) 

[B  I AB  1 I . . . I B].  (210) 


this  Is  the  approach  taken  by  Gupta  and  Hall  (Ref  108)  and  Gupta 


Indeed 


and  Mehra  (Ref  84)  In  order  to  reduce  the  required  number  of  differential 


equations  for  calculating  the  sensitivities  vhen  there  are  zero  Initial 


conditions.  They  examine  linear  independence  of  the  column  vectors  of 


the  matrix  (210)  and  conclude  that  there  are  never  more  than  nr  Independent 


columns 


obtain  a matrix  test  for  sensitivity  controllability  which  Is  equivalent 


to  the  matrix  (210) . For  consistency,  we  consider  output  sensitivity 


controllability  and  use  the  operator  definitions  presented  In  Section  II. 4 


(Definitions  4 and  5).  As  in  the  previous  section,  the  main  Intention 


here  is  to  demonstrate  how  the  description  of  Section  III  can  be  conven 


iently  utilized  to  determine  controllability  properties  of  the  sensitiv- 


ity operators.  The  central  result  Is  given  In  the  following-  theorem 


Theorem  7 


The  system  S. _ Is  output  sensitivity  controllable  If  and  only  If  the 


rank  of  the  m(p  -f  1)  x 2nr  dimensioned  matrix  G,  equation  (200)  Is 
n(p  + 1).  The  output  sensitivity  controllable  subspace  of 


the  range  space  of  G whereas  the  uncontrollable  subspace  Is  the  null 
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Proof 


Pron  equation  (199)  we  have 


Y (t,u)  - Gg(t:u) 

S e ■ • 


(211) 


Then  since  the  2n  scalar  functions  0^(0,  which  comprise  the  vector  func- 
tion a(*)  are  linearly  Independent  on  every  Interval  of  finite  length,  it 


is  easily  shovm  that  the  operator 

g(t;*)  - a(t  - s)(Ods 
0 


(212) 


maps  L2(0,t;R^)  onto  (Luenberger  (Ref  20;  56))  Thus  the  range 

space  (controllable  subspace  - see  Definition  4)  of  the  operator 


W(t)(*)  - Gg(t;*) 


(213) 


is  the  range  space  of  the  matrix  G.  By  Definition  4 and  5 the  remaining 


conclusions  of  the  theorem  follow  immediately. 


Q.E.D. 


Controllability  properties  of  the  "sensitivity  system"  for  the  system 
are  quite  Important  for  several  reasons.  As  mentioned,  Gupta  and  Hall 
(Ref  108)  and  Gupta  and  Mehra  (Ref  84)  use  controllability  properties  of 
the  matrix  pair  (A,B)  to  reduce  the  number  of  differential  equations  for 
computing  the  sensitivities  for  linear  time-invariant  systems  with  zero 
initial  conditions.  Their  method  may  be  extended  to  system^  with  non-zero 
initial  conditions  by  the  addition  of  suitable  "pseudo"  control  inputs. 
This  method  of  reduction  results  in  the  mlnlminn  number  of  required  differ- 
ential equations  and  eliminates  some  of  the  computational  problems  which 
can  be  associated  with  the  use  of  special  canonical  form  reduction  tech- 
niques. (Ref  45)  (Ref  47) 
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Xn  a almllar  nannar,  we  nay  use  the  sensitivity  controllability 
properties  to  reduce  computational  requirements  associated  with  the  oper- 
ator description  of  the  sensitivities.  To  be  aore  explicit,  suppose  that 
the  rank  of  the  m(p  + l)x2nr  matrix  G Is  y.  Then  y 1*  the  dimension  of 
the  output  sensitivity  controllable  subspace,  and  there  are  but  y linearly 
Independent  functions  from  the  m(p  -4-  1)  functions 

{y  (Oi  1 ■ 1«  2,  ...p}.  Therefore,  the  remaining  n(p  -f  1)  - y 

Se8e  2e8e 

functions  may  be  determined  as  linear  combinations  of  the  y Inchependent 
functions.  Indeed,  there  exists  a non-unique  transformation  matrix  R such 
that  the  matrix  product  KC  has  the  partitioned  form 


KG 


m(p+l)x2nr 


(214) 


where  G has  dimension  yx  nr  and  rank  y.  (Ref  21:  54)  Then  the  first  y 

c 

elements  of  the  transformed  vector  function 


Y,  . (su)  - KY,  ^ (Su)  (215) 

2e8e  ZeSe 

will  be  linearly  independent  combinations  of  the  2nr  functions  of 
t e [0,t^] 


aj(t  - s)uj^(s)ds  (216) 


J " 1,  2,  ... 2n 
k * 1,  2,  ...r. 


The  remaining  m(p  + 1)  - y elements  of  Y (•;“)  then  Identically 

Z e 8 * 

zero.  This  fact  shows  the  relationship  between  sensitivity  controllability 
properties  and  zero-state  insensitivity;  the  m(p  + 1)  - y elements  of 
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T (*;u)  ar*  zaro-acatc  tnscnalClvc  Idaatlcallx  zaro)  for  all 

poaalbla  control  Inputa. 

Another  Inportant  aapact  of  tha  controllability  propartlca  of  the 
aanaltlvlty  system  is  associated  with  making  T(t^;u)  Identically  zero  at 
the  terminal  time.  (Kalman,  et  al  (Kef  17))  This  aspect  of  controllability 
of  the  sensitivity  system  has  been  examined  by  a nueber  of  researchers 
(e.g.,  Guardabassl,  et  al,  (Ref  52)),  and.  In  particular,  sufficiency  con- 
ditions for  uncontrollablllty  of  the  sensitivity  system  have  been  an  area 
of  consld.^rable  Investigation,  (e.g.,  (Refs  48  - 53)).  Cuarc'.«bassl,  et  al, 
(Ref  52)  present  the  most  comprehensive  results  on  uncontrollablllty  of 
the  sensitivity  system,  and  a general  statesient  of  their  results  is  that 
the  sensitivity  system  is  always  uncontrollable  If  the  parameter  dimension 
p exceeds  the  control  dimension  r. 

Ve  nay  use  the  controllability  properties  of  the  sensitivity  system 
along  with  our  matrix-operator  description  to  give  the  following  result 
concemiu)}  necessary  and  sufficient  conditions  In  making  the  sensitivity 
vector,  7(t^;u),  identically  zero  at  the  terminal  time  t^: 


Corollary  7.1 


For  the  system  there  exists  a control  u*eL2(0,t^;R*^)  suci.  -hrt 
T(t.:u’')  " 0 If  and  only  If  the  rank  of  [E  G]  Is  equal  to  the  rank  of  G. 


Proof 

For  sufficiency,  suppose  chat  the  rank  of  [¥  and  G are  both  y^. 
Then  there  exists  a non-singular  transformation  matrix,  K, , such  that 


(217) 


where  the  rank  of  (Ej^  Gj^]  Is  Yj^,  (Nerlng  (Ref  21:  54)).  Define  the 
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2nrx2nr  positive  definite,  partitioned  matrix  in  which  each  partition 
la  dimension  r x r,  and  the  1.  partitioned  element  la  given  by 
t 


Mjd.j)  - /q  «j(t,  - - t)dt  I 


rxr 


(218) 


Then  alnce  the  rank  G • rank  G,  ••  y, » the  y,  x y,  matrix  (G,M*gT)  Is 

11  11  1 c 1 

Invertible.  Next  define  the  r x 2nr  dimensioned  matrix  fuuctlon 


**(t)  H 


aj(t)  0 


a^(t) 


. . 0 


a2(t)  ...  0 


a^Ct)  0 ...  a2(t)  ...  0 


Then  It  is  straight-forward  to  show  that  the  control^ 


. .a 


2n 


(219) 


«*(t)  - 


.-1 


(220) 


^ere 


^z.l.(^f>  ■ V(^f>  (221) 

will  satisfy 

T(V“’>  ■ 

— 1 ^ 

- V(^z.i.(*^f^  Vo  " t)*u*(t)dt) 

- 0 (222) 

^Indeed,  by  the  results  of  Kalman,  et  al  (Ref  17)  It  may  be  shown  that  u* 
has  norm  strictly  less  than  any  other  control  for  which  the  relation  (222) 
la  true. 


(t) 
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Sow  for  necessity,  suppose  that  the  rank  of  IE  6]  is  and  the  rank 
of  G la  Y2  < Yj.  Furthermore,  let  Gj^,  and  E^  be  the  same  matrices  as 
defined  above.  Then  since  rank  Gj^  ■ rank  G - Y2  ^ Yjt  there  exists  a 
tranaformatlon  matrix  K2  such  that 


- K2 


0 

0 


(223) 


where  the  rank  of  Gj  Is  Y2*  Then  since  the  rank  of  [E  G]  Is  Yj  ^ Y2*  we 
are  assured  that 


K2K1  IE  G] 


I _ 

I G, 


I 0 
0 0 


(224) 


where  the  rank  of  E2  Is  Y^*  Hence  there  Is  no  control  which  will  bring 


the  system  to  zero  conditions  In  a finite  Interval  of  time. 

Q.E.D. 

Lastly,  assume  for  the  moment  that  the  C matrix  Is  the  Identity  matrix 
and  Independent  of  the  parameter  vector  b so  that 


Y(t;u)  ■ x(t:u). 


(225) 


Then  we  may  use  the  non-unique  transformation  matrix,  K^,  described  in  the 
proof  of  Corollary  7.1  to  transform  the  "sensitivity  system"  Into  a set  of 


Y^  non-zero  differential  equations.  In  particular,  we  define  the  new 


’state" 
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X (t)  - Kj^X(t) 


(226) 


where  Xj^(t,  Is  dimension  v, . Then  from  equation  (89) 

X (t)  - K^AK^^  (t)  + K^Bu(t)  (227) 

x"(0)  - t e [0,  t^J  (228) 

But  from  equation  (226)  we  are  assured  that  only  the  first  elements  of 
the  differential  equation  (227)  will  be  non-zero.  Therefore,  we  may 
generate  X(t)  with  only  y^  linear  differential  equations.  Furthermore, 
if  y^^  ■ rank  [E  G]  and  if  the  separate  control  inputs  are  linearly 
independent  functions,  then  this  is  the  absolute  minimum  number  of 
differential  equations  which  will  generate  the  complete  sensitivity  sys- 
tem. Notice  that  this  result  is  an  extension  of  the  conclusions  of  Gupta, 
et  al,  (Ref  84)  (Ref  108)  from  the  case  of  zero  initial  conditions  to  the 
case  in  which  the  initial  conditions  are  arbitrary  and  may,  in  fact,  be  a 
function  of  the  parameter  vector  b.  Also,  we  emphasize  the  fact  that  the 
transformation  matrix  always  exists  regardless  of  conditions  on  the  A 
matrix,  and  it  is  non-unique.  One  convenient  way  to  determine  a transforma* 
tlon  matrix  is  to  determine  the  first  set  of  linearly  Independent  rows 
of  [E  G],  and  then  determine  the  matrix  K,  which  leaves  these  Independent 
rows  unaltered  and  zeros  the  remaining  dependent  rows. 
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Example 

To  Illustrate  the  results  on  sensitivity  controllability,  consider 
the  second  order  system: 


Xj^(t;u) 

X2(t;u) 

IB 

0 1 

h *>2 

- - 

L. 

-J 

yj^(t;u) 

y2(t;u) 

- 

1 0 

0 1 

Xj^(t;u) 


X2(t;u) 


0 


u(t) 


Xj^(t;u) 


X2(t;u) 


The  matrix  G Is 

(0) 

1“ 

G - 


.(1) 


.(2) 


.(3) 


0 

b. 


b,b3  + b‘b3 


‘>2'>3 


b,b3  + b2b3 


2b,b,b3  + b^b. 


0 

0 


0 

0 


0 

b. 


2b2b3 


0 

0 


0 

b. 


2b2b3 


2b2b3 


2b^b3  + 3b‘b3 


0 

1 


1 

b„ 


bi  + b‘ 


.(3) 


(22! 


(231 


(233 


Notice  that  If  b3  Is  zero,  then  only  ' Is  controllable.  Otherwise, 
there  will  be  four  linearly  Independent  rows  of  the  natrlx  G.  and  so 
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four  elements  of  the  zero-state  sensitivity  system  can  be  computed  as 
linear  combinations  of  the  four  "controllable"  ones.  For  Instance, 
suppose  that  bj^  ■ -2,  b^  ■ -3,  and  b^  ■ 1.  Then 
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notice  t^at  K Is  selected  here  to  leave  the  first  four  rovs  of  G 

unaltered  while  zeroing  the  last  four.  Therefore,  If  the  system  has 

zero  Initial  conditions  then  the  entire  set  of  sensitivities  can  he 

generated  from  the  four  differential  equations  for  z(t)  and 

Next,  suppose  that  the  example  system  has  Initial  conditions 
T 

d * [1  0].  Using  the  nominal  parameter  values  hj^  ■ -2,  b2  * **3,  and 
bj  ■ 1 It  Is  easily  shown  that 

[1  0 -2  6 0 1 -3  7 1 

0 -2  6 -14  1 -3  7 -15 


0 0 


-3  0 0 


[E  G]  - 0 1 -3  5 0 0 


0 0 


-2  0 0 


(236) 


0 0 -2  12  0 1 -6  23 


0 0 


0 1-37 


0 0 


1-37  -15 


which  has  the  rank  of  six.  Then  the  transformation  matrix 


-2  0 -3  -III 


0-10 


-2  2 


(237) 


leaves  all  of  the  rows  of  [E  G]  the  same  except  zeroing  cne  last  two  rows. 
Thus  the  sensitivities  with  respect  to  the  parameter  b^jmay  be  determined 
as  linear  combinations  of  x(t;u),  C^^^(c«u),  and  (^^^(t;u),  and  six 
differential  equations  are  now  required  to  compute  the  complete  sensitiv- 
ity system.  Finally,  by  Corollary  7.1  there  exists  no  control  which  will 
completely  zero  X(t;u)  In  a finite  Interval  of  time. 
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4.  PARAMETER  IDENTIFIABILITT 

Th«  final  topic  considered  in  this  section  is  identlflabillty  of  the 
mknown  parameters  in  the  system  One  could  specialize  Lemma  4 and  test 

the  singularity  of  the  p x p local  ident if lability  matrix: 

M-  A [v^^^(t;u)/v^^(t;u)]dt  (238) 

0 

Identif lability  would  then  be  determined  according  to  whether  or  not  M 
has  rank  p.  Such  an  approach  is  appropriate  for  the  time-varylfig  linear 
system,  S^,  and  is  the  method  commonly  known  in  the  literature  (e.g., 

(Ref  102)).  However,  for  the  time-varying  system,  the  definition 

of  identif lability  Itself  (namely,  V mapping  R^  into  the  output  space 
y ■L2(0»bJR™)  Is  one-to-one  — see  Definition  5)  may  be  used  to  obtain 
direct  algebraic  conditions  for  Identif lability  (both  local  and  struc- 
tural) . 

As  discussed  in  Section  II. 7,  the  conditions  for  zero-input  and  zero- 
state  identlflablllty  are  studied  separately.  This  separate  analysis 
allows  one  to  determine  whether  a parameter  is  identifiable  from  either 
the  transient  response  alone  or  the  forced  response  alone  or  from  both. 

Lemma  5 then  concludes  that  the  combined  identifiable  subspace  is  just 
the  sum  of  the  zero-input  and  zero-state  identifiable  subspaces. 

Finally,  we  note  the  approach  taken  here  in  treating  Identlflablllty 
and  the  results  obtained  are  new.  Background  literature  on~ldentlfl- 
ablllty  for  the  system  was  discussed  in  Section  I.l.d.  Although 
there  have  been  previous  structural  and  algebraic  conditions  for  Identi- 
fiabllity  these  previous  approaches  generally  rely  upon  special  canonical 
form  representations  of  the  system  S^^.  As  done  throughout  this  research, 
we  do  not  make  any  assumptions  (except  differentiability)  concerning  the 
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■ystem  matrices,  and  so  the  results  may  be  applied  either  with  or  without 


such  canonical  forms 


a.  Zero~Input  Identlflablllty 


To  discuss  zero-input  Identlflablllty  It  Is  convenient  to  rearrange 


equation  (191)  Into  the  form 


where  we  define 
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Using  this  notation  and  the  definitions  of  IdentlflaMllty  (Deflnl- 

! 

tlons  5,  6,  and  7)  we  may  directly  obtain  algebraic  conditions  for  zero- 

( 

Input  Identlf lability.  We  define  the  following  zero-input  Identifiable  ' 

subspaces: 

^ s zero- Input  Identifiable  subspace  with  | 

*J  respect  to  the  output  component  y . 

Z.l.j 

i M . = zero-input  Identifiable  subspace 

I Z s Xe 

I 

i ' 

I Using  this  notation  we  obtain  the  following  central  theorem: 


I I 


f I 


t I 


X 


Theorem  9 


For  the  system  it  is  true  that: 


DM  - R(Ef ) 

z.i.j  J 


(243) 


11)  U . - R(E*") 

ZeXe 


(244) 


111)  The  system  Is  zero-input  identifiable  If  and  only  If  the 


rank  of  E*  Is  p. 


Proof 


By  Definition  5,  the  zero-input  non-identlflable  subspace  relative 


to  the  output  component  y . (*)  Is  the  null  space  of 

Z.l.j 


(•)  - a’^(OE* 

..i.j  3 


(245) 


But  since  the  scalar  functions  aj(*)  are  linearly  Independent  (and  hence 


non-zero  almost  everywhere),  the  null  space  of  V . (•)  is  just  the  null 

* i 

space  of  E. . The  Identifiable  subspace  Is  the  orthogonal  complement  of 
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the  non-* Identifiable  subspace,  which,  by  Lemma  1 Is  the  range  space  of 
*T 

E.  . Then  S,  - Is  zero-input  Identifiable  relative  to  y . If  and  only 

J iaC  Ze  Xe  j 

* ^ 

If  the  rank  of  Is  p. 

From  equation  (140) , the  conclusions  of  part  11)  and  111)  follow  In 


a manner  similar  to  that  above. 


Q.E.D. 


Theorem  9 provides  the  central  algebraic  results  on  zero-input 
Identlflablllty.  The  results  are  only  dependent  on  the  structure  of  the 
system  matrices  and  Initial  condition  vector;  they  are  completely  general; 
and  no  Integrals  are  required  to  obtain  the  zero-input  Identlflablllty 
matrix.  Notice  the  connection  between  this  theorem  on  local  Identlfl- 
ablllty  and  zero-input  Insensitivity;  the  matrix  E*  used  here  has  the  same 
elements  as  the  E matrix  which  was  used  In  Theorem  5 In  connection  with 
zero- Input  Insensitivity.  If  the  system  Is  totally  zero-input  Insensitive 
(i.e.,  E*  * 0),  then  the  non-ldenclflable  subspace  Is  all  of  and  there 
Is  no  Information  with  which  to  Identify  any  of  the  parameters  from  the 
transient  response.  Such  a connection  between  Insensitivity  and  non- 
Identlf lability  was  noted  by  Bonlvento  (Ref  55),  but  no  specific  results 
on  Identlflablllty  were  obtained. 

As  would  be  expected,  this  theorem  on  zero-input  Identlflablllty 
reduces  to  previously  known  results  concerning  system  observability 
(e.g..  Ref  33))  for  the  special  case  In  which  d(b)  >•  b Is  the  only  unknown 
system  parameter  vector: 


Corollary  4.4.1 

If  d(b)  • b Is  the  only  unknown  system  parameter  vector,  then 
Is  ''observable"  If  and  only  If  the  rank  of  the  n x mn  matrix 


,_T  I ! I >„.n-l.T, 

[C  I (CA)  1 ...  I (CA  ) ] 


(246) 


Is  n. 
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Proof 

Since  d(b)  ••  b Is  the  only  unknown  parameter.  It  Is  true  that 


- t(Cjej^).  . .(CjA^“  “ \)] 
where  e^^  Is  the  1^*^  Euclidean  basis  vector.  Then 


EJ. 


CjA 


Cj* 


2n-l 


2nxn 


(247) 


(248) 


\ 


Using  the  Cayley-Hamllton  theorem  (Ref  33)  It  is  easily  shown  tnat  the 
rank  of 


m 


(249) 


2nmxn 


Is  equal  to  the  rank  of  the  matrix  (246) . 


Q . E/U . 


Finally,  we  note  that  if  the  system  has  eigenvalues  with  strictly 

aegatlve  real  parts  (S^^  Is  asymptotically  stable),  or  ff  It  has  a single 

zero  eigenvalue  with  zero  sensitivities  with  respect  to  that  eigenvalue, 

then  all  of  the  sensitivity  operators  will  be  asymptotlcrlly  stable  In  the 

first  case  or  marginally  stable  In  the  second.  (See  (Ref  52)  or  Corollary 

2.4)  Then  although  the  scalar  functions  aj(*),  J ■>  1,  2,  ...2n,  are 
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linearly  independent  and  hence  non-zero  almost  everwhere,  after  a few 
system  time  constants  they  will  be  so  near  to  zero  that  there  is  very 
little  "signal  strength"  remaining  In  the  sensitivities.  Therefore,  for 
all  practical  purposes,  the  zero-input  response  will  no  longer  be 
Identifiable  In  the  steady-state. 


b.  Zero-State  Identlf lability 

The  question  of  zero-state  Identlf lability  Is  somewhat  more  complex 
than  zero-input  Identlf lability  because  the  Input  function, 
u(’)  z L2(0,t^;R^),  can  also  affect  Identification  capability.  Certainly 
If  u(»)  ■ 0,  then  y (su)  “ 0 and  there  Is  no  Information  whatsoever 

Z • 8 • 

from  the  zero-state  response.  The  question  of  Input  design  for  the  pur- 
pose of  optimal  parameter  Identification  was  discussed  In  Section  II. 8 
and  comnutatlonal  algorithms  to  calculate  the  optimal  Input  are  con- 
sidered In  Section  VII.  Here  we  derive  necessary  conditions  for  zero- 
state  Identlf lability:*  sufficiency  conditions  are  dependent  upon  the 
control  Input  which  one  selects. 

Like  the  previous  section.  It  Is  helpful  to  rearrange  equation  (192) 
Into  the  form 


V (t;u)  - [vf^^  (t;u).  . .vf^^  (t;u)] 

Zaoe . Z*8a«  ZeSa* 


'i 


•j 


Ixp 


- g’^(t;u)Gj 


(250) 


V,  _ (t;u) 

Z • 8 « 


h... 

m 


g^(t;u)G* 


g^(t;u)G* 

n 


- g^(t;u)*G* 
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where  we  define 


OJ. 


,,(!)’  (2)'' 

tOj  Oj  . 


2nrxp 


and 


(252) 


(253) 


Then  defining  the  following  zero-state  Identifiable  subspaces: 

^ (u)  = zero-state  Identifiable  subspace  relative 

* 'J  to  the  output  component  y (•;«); 

z . s . j 

M (u)  = zero-sta*-e  Identifiable  subspace; 

we  may  use  the  above  notation  to  obtain  the  central  result  on  zero-state 
identlf lability: 


Theorem  10 


For  the  system  It  Is  true  that 


1)  M,  . (u)  C RCcf ) 

Z«8«j  J 


(254) 


It)  _ (u)  C R(G*^) 

Z • 8 • 


(255) 


111)  Is  zero-state  Identifiable  only  If  the  rank  of  G*  Is  p. 

Proof 

For  part  1),  the  non-ldentlf iable  subspace  with  respect  to  the  out- 
put component  y (•;«)  Is  the  null  space  of  V C*;'!).  From  equa- 
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Hence 


where  the  last  step  follows  from  Lemma  1 


Fart  11)  Is  obtained  In  a similar  manner.  Then  the  system  S. _ Is 


zero-state  Identifiable  If  and  only  If  M (u)  ■ R*^  which  can  be  true 


The  conditions  of  Theorem  lO  can  be  made  sufficient  as  well  as 


necessary  (l.e.,  the  sjubspace  relations  are  replaced  by  equalities) 
provided  that  the  2nr  scalar  functions  of  t e [0,t.] 


are  linearly  Independent.  This  condition  may  be  tested  by  considering 
whether  or  not  the  2nr  x 2nr  Gram  matrix  (see  equations  (185)  - (186)) 


Is  non-singular.  Since  the  scalar  functions  a.(*)>  j ~ 1>  2 


are  linearly  Independent  on  every  interval  of  finite  length,  the  condl 


tlon  of  linear  Independence  of  the  functions  (258)  Is  not  too  severe  from 


a pure  mathematical  point  of  view.  However,  from  a practical  point  of 


view.  If  the  system  has  stable  elgen''alues  then  the  scalar  functions  a.  (*) 


all  tend  to  zero  In  the  steady  state.  This  Is  easily  seen  by  recalling 
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that  the  functions  a^Ct)  are  linearly  Independent  combinations  of  the 
functions  (see  Appendix  A).  Therefore,  In  the  steady  state  the 

potential  for  "structural"  zero-state  Identlf lability  can  only  be 
realized  If  the  control  Input  Is  selected  to  "excite"  the  modes  of  the 
system  to  a sufficient  degree  to  maintain  linear  Independence  of  the 

a •* 

functions  (258).  This  Is  the  problem  referred  to  by  Astrom  and  Eykhoff 
(Ref  80)  as  the  requirement  for  "persistent  excitation".  For  the  class 
of  sinusoidal  Inputs  this  problem  has  been  studied  by  Hehra  (Ref  98), 

Kim  and  Llndorff  (Ref  95),  Hoberock  and  Stewart  (Ref  94),  and  others 
to  determine  sufficient  ntmbers  of  Input  frequencies  to  ensure  steady- 
state  Identlf lability. 

Finally,  we  comment  that  the  results  of  Theorem  10  may  be  applied 
to  one  column  vector  of  B at  a time,  and  thereby  we  may  obtain  the 
zero-state  Identifiable  subspace  relative  to  each  control  Input  component, 
u^(*),  k - 1,  2,  ...r.  Then  by  selecting  all  of  the  controls  to  be 
Identically  zero  except  u^,  the  parameter  components  which  lie  In  the  non- 
Identlflable  subspace  of  this  Input  will  not,  to  first  order.  Influence 
the  observed  output.  Again  this  may  be  a possible  systematic  way  tc 
reduce  the  number  of  Influencing  parameters  In  a series  of  parameter 
Identification  experiments.  Indeed,  the  non-zero  Input  component,  u^, 
may  even  be  selected  to  optimize  the  estimation  capability  for  those 
parameters  which  lie  In  Its  Identifiable  subspace.  (See  Section  VII) 

c.  Summary  of  Identlf lability  Conditions  for  S^^ 

In  the  previous  two  subsections  the  zero-input  and  zero-state  local 
Identifiable  subspaces,  M . and  M , have  been  determined.  By  using 

ZeXe  Ze8e 

the  operator  definitions  of  local  identlflablllty  (Definition  5)  and  the 
algebraic  representation  of  the  sensitivity  operators  for  the  system  S^, 
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(Theorem  4),  both  of  these  subspaces  may  be  obtained  thrpugh  purely 


algebraic  means  which  Involve  neither  the  determination  of  the  sensitiv- 


ity operators  themselves  nor  Integration  for  determining  the  local 


Identlf lability  matrices 


(t)]dt 


These  results  and  the  approach  taken  are  new  and  should  offer  useful 


Finally,  Lemma  8 says  that  the  total  Identifiable  subspace  Is 


while  the  total  non-ldentlflable  subspace  Is 


Again  consider  the  second  order  syscem  introduced  In  the  example 


of  Section  IV. 3,  equations  (229)  (230).  For  simplicity  we  will  examine 


only  zero-state  Identlflablllty  (we  will  assume  the  system  (229)  (230) 


has  zero  Initial  conditions).  Using  equation  (231)  we  obtain 


AFAL-TR-76-118 


gJ-  Ig[ 


(1) 


and 


0 

0 


2b2b3 


0 

1 


b,  + b2 


[G, 


(1) 


(2)’’  (3)’’ 

^^2  °2  ^ 


2b2b3 


0 

b. 


2b2b3 


2bj^b3  + 3b2b3 


1 

^2 

2b3^b2  + b2 


(264) 


(265) 


Hence  If  63  Is  non-zero,  then  the  system  (229)  - (230)  is  zero-state 
identifiable  from  either  the  output  or  the  output  y2  (since  the  rank 
of  gJ  and  G*  are  both  3).  However,  if  b3  is  zero,  then  only  b3  is  zero- 
state  identifiable  from  either  output. 


5.  SUMMARY 

In  this  section  the  geometric  properties  of  Theorem  4 have  provided 
algebraic  conditions  for  insensitivity,  sensitivity  controllability,  and 
Identlf lability  for  the  time- invariant  system  Some  of  these  condi- 

tions are  restatements  or  extensions  of  previously  known  results  (e.g., 
conditions  lor  zero-input  and  zero-state  Insensitivity  (Theorems  5 and  6) , 
general  algebraic  conditions  for  sensitivity  controllability,  (Theorem  7), 
and  definitive  algebraic  conditions  for  zero  terminal  sensitivity 
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(Corollary  7.1)).  However,  In  other  cases  the  conclusions  and  the 
approach  taken  are  new  (general  algebraic  conditions  for  local  and 
structural  Identlflablllty  (Theorems  9 and  10),  and  a systematic  way 
to  reduce  the  number  of  Influencing  components  In  a series  of  Identlfl-' 
cation  experiments  (Corollary  5.1)).  In  all  cases  the  operator  approach 
of  Section  II  provides  a straight-forward  means  to  derive  the  results, 
and  no  transformation  of  the  system  to  a suitable  canonical  form  Is  re- 
quired for  the  analysis.  A simple  second-order  system  was  used  to 
Illustrate  the  various  algebraic  conditions  and  to  demonstrate  the  way 
In  which  these  algebraic  conditions  can  be  used  to  provide  structural  as 
well  as  local  Information. 

In  the  next  three  sections  we  concentrate  on  the  computational  as- 
pects of  Theorem  4 by  considering  parameter  Identification  algorithms, 
control  design  for  minimum  sensitivity,  and  sensitivity  operator  design 
for  Improved  parameter  Identification.  Once  again  the  operator  approach 
of  Section  II  Is  used  as  the  basis  for  this  discussion. 
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Section  V 

QOASILINEARIZATION  PARAMETER  IDENTIFICATION  FOR 
LINEAR  TIME-INVARIANT  SYSTEMS.  Sj^ 

The  previous  section  Investigated  geometrical  properties  of  the 
natrlx-operator  representation  for  parameter  sensitivities  In  the  linear 
time-invariant  system.  S.-,  (see  Theorem  4).  In  this  section  the 
computational  aspects  of  this  representation  are  considered  by  using  It 
as  the  basis  of  a quaslllnearlzatlon  algorithm  for  the  Identification 
of  unknown  parameters  In  the  system  S^^.  Quaslllnearlzatlon  Is  a well- 
known  method  for  the  estimation  of  parameters  In  such  systems  (e.g., 

Kumar  and  Srldhar  (Ref  86)).  and  so  the  unique  element  of  this  section 
Is  to  Illustrate  how  the  results  of  Theorem  4 and  Appendix  A may  be 
used  as  an  alternative  to  differential  equation  computation  of  the  param- 
eter sensitivities. 

The  quaslllnearlzatlon  algorithm  utilized  Is  based  upon  the  dis- 
cussion of  Section  II. 6.  We  noted  In  that  section  that  the  computational 
requirements  of  quaslllnearlzatlon  are  fairly  representative  of  gradient 
and  Newton-Raphson  algorithms  as  well.  However,  the  Newton-Raphson 
aiethod  requires  one  to  compute  the  second  order  as  well  as  the  first  order 
parameter  sensitivities.  Appendix  B presents  a matrix-operator  form  of 
the  second  order  sensitivities  which  Is  equivalent  to  the  one  developed 
In  Section  III  and  used  here  for  the  first  order  parameter  sensitivities. 

1.  QUASILINEARIZATION  ALGORITHM 

The  measured  system  output  Is  denoted.  y(t).  t e [O.t^].  and  the 
model  system  output  at  the  Initial  guess  of  the  parameter  vector. 


Ill 
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b e sF,  is  designated  y(t;b  ),  t c [0,t-].  The  output  error  cost  func- 
o ox 

tlonal  which  we  wish  to  minimize  Is  (see  Section  II.6*d): 


JpCb)  - [(y(t)  - y(t;b))/Q(t)(y(t)  - y(t;b))]dt 


(266) 


where  the  Q(t)  e is  assumed  to  be  symmetric  and  uniformly  positive 

definite  for  all  t e [0,t^].  From  Section  II. 6. b and  II. 6. d the  quasl- 
llnearlzatlon  algorithm  to  minimize  this  cost  functional  Is  obtained 
from  the  sequence  of  steps: 


Abj  - (V*QV)"^  V*QZj 


(267) 


whe.e  (V*QV)~^  Is  the  p x p local  "Information"  matrix^ 


(268) 


(V*QV)j  - /Q^[v^^^(t;bj)/Q(t)v^^(t;b^)]dt  ; 


(269) 


V*QZj  Is  one-half  the  negative  gradient  of 


V*Qz^  - [v^^^(t;bj)/Q(t)z(t;bj)]dt  ; 


(270) 


and  z(t;bj)  = y(t)  - y(t;bj)  Is  the  output  error  at  the  current  estimate 

^ T> 

b^  c R . Recall  from  Section  II. 7 that  the  local  Information  matrix, 
(V*QV)j,  possesses  a bounded  Inverse  If  the  system  Is  locally  Identifiable 
at  b^  c R*^.  l/e  will  assume  Chat  Che  system  Is  locally  Identifiable  along 


^Note  that  If  Che  output  Is  only  sampled  at  discrete  points,  Chen  the 
integrals  In  equations  (269)  and  (270)  would  be  replaced  by  summations 
over  these  discrete  time  points. 
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the  sequence  of  estimates > and  hence  that  these  Inverses  do  Indeed 
exist. 

Notice  that  If  a direct  “sensitivity  system”  differential  equation 

A 

approach  were  used  In  determining  the  output,  y(t;b.),  and  sensitivities 
(1) 

V (t;bj),  1 * 1,  2,  ...p,  then  each  Iteration  would  require  the  solu- 
tion of  possibly  n(p  -<■  1)  coupled  linear  differential  equations^  In  addi- 
2 

tlon  to  the  (p  + 3p)/2  quadrature  Integrals  required  for  computing 
(V*QV)j  and  • This  number  of  differential  equations  and  Integrals 

may,  of  course,  be  reduced  If  the  controllability  properties  of  the 
sensitivity  system  ere  taken  Into  account  (see  Section  IV. 3),  and  Gupta 
and  Mehra  (Ref  84)  present  the  details  of  this  as  applied  to  parameter 
Identification  algorithms.  However,  as  a computational  alternative,  we 
apply  the  results  of  Section  III  and  Appendix  A,  thereby  replacing  the 
solution  of  the  differential  equations  with  the  determination  of  the 
eigenvalues  of  A(bj)  and  the  solution  of,  at  most,  2nr  quadrature  Integrals 
for  computing  the  parameter  sensitivities^. 

The  computational  details  of  this  method  are  presented  In  the  sub- 
sequent algorithm.  The  notation  used  Is  based  upon  the  development  of 
Appendix  A In  which  the  matrix-operator  representation  of  the  parameter 

sensitivities  Is  factored  directly  Into  "component"  form  utilizing  the 

1 1 ^k^ 

basis  functions,  t'^e  , for  real  eigenvalue  q.  (or  f^e  cosoi.  t and 

1 *^k^ 

f'e  slnii^t  for  complex  eigenvalues)  rather  than  the  linear  combinations 
of  the  scalar  functions,  aj(t)  (see  Theorem  4).  In  particular,  the  sys- 
tem nominal  output  and  output  sensitivities  are  computed  from  the  relations 

y(t;b,)  - F^°^f(t)  + f(t  - s)*u(s)ds  (271) 

J 0 


^Recall  that  r is  the  control  dimension,  n Is  the  state  dimension,  and  p 
Is  the  parameter  dimension. 
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v^*^(t;bj)  - P^*^f(t)  + /J  f(t  - 8)*u(8)d8  (272) 

1 • 1«  2,  . . .p 

where  £(*)  l8  a 2n-<limen8loned  vector  function  defined  in  Appendix  A and 
which  has  elements  of  the  form  t^e  ^ F^^\  and 

1 ">  1,  2,  ...p,  are  m x 2n  and  m x 2nr  dimensioned  matrices  and  are 
computed  as  described  in  Appendix  A;  and  we  define  the  special  product 


f(t  - 8)*u(8) 


fj^Ct  - 8)u(8) 
f2(t  - s)u(8) 


(273) 


f2jj(t  - 8)u(s) 


2nrxl 


This  form  is  used  for  computation  rather  than  the  one  used  in  Theorem  4 
and  Section  IV  because  the  convolutions  of  equations  (271)  - (272)  may 
be  directly  transformed  into  the  time-varying  linear  combination  of 
quadrature  Integrals  through  relations  such  as 


(t  - 8)e 


■ik(t  - .) 


■ te 


- e 


V 

se 


(274) 


However,  either  form  is  equivalent  and  the  differences  are  merely  a matter 
of  bookkeeping  on  the  digital  computer.  The  form  (271)  - (272)  will  also 
be  utilized  in  the  computational  applications  of  Sections  VI  and  VII. 

The  algorithm  for  quaslllnearlzatlon  parameter  Identification  is  now 
stated: 


Qucslllnearlzatlon  Algorithm  to  Identify 
the  Parameters  in  the  System  S^- 
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1)  Bated  upon  the  current  estimate  of  the  unknown  parameters,  e R^, 

A 

determine  the  distinct  eigenvalues  of  A(bj),  k 1,  2,-  ...p,  and 

A 

tnelr  multiplicity,  n^.  In  the  characteristic  polynomial  of  A(bj).  Note 
that  If  the  eigenvalues  of  have  been  computed,  then  Iterative 

techniques  for  computing  the  eigenvalues  of  A(bj)  might  be  useful. 

(e.g.,  (Ref  11:  455  - 495)).  Indeed,  by  using  the  expression  (157) 

suggested  In  Section  III  for  computing  the  eigenvalues  sensitivities,  a 
first  guess  of  the  new  eigenvalues  might  be  computed  from  the  Taylor's 
series  relation 


(275) 


where  the  superscripts  "j”  and  "j-1"  refer  to  the  and  J-l*"^  Iterations, 
respectively.  (See  the  example  of  Section  V.2  for  Illustration.) 

A 

2)  Using  the  eigenvalues  of  A(bj),  Invert  the  2n  x 2n  Vandermonde  matrix, 
A.  (See  Appendix  A.)  Then  using  the  elements  of  A and  the  Input 
quantities  A,  B,  C,  d,  A^^^,  and  d^^^,  1 “ 1,  2,  ...p,  compute 

the  matrices 


P - P' 


H - H' 


(276) 


m(pfl)x2n 


|m(p+l)x2nr 


(See  Appendix  A for  the  definition  and  computation  of  the  above  quanti- 
ties.) 


115 


AFAL-TR-76-118 


3)  Compute  end  piece  In  temporery  etorege  the  (p  -f  1)  m-dlmenelonel 
vector  fuuctlone  of  t e [0,t.],  y(t;b.)-,  v^^^(t;t.),  1 ■ 1,  2,  ...p. 


(See  equetlone  (271)  - (272).) 


uelng  equetlone  (269)  - (270).  Mote  thet  the  "controlleblllty"  properties 


of  the  sensitivities  may  be  used  to  reduce  the  number  of  Integrals 
(siinnatlons)  required  In  computing  the  matrix  (V*QV)j  and  the  vector 
V*Qz.  (see  Section  IV. 3).  In  particular.  If  v Is  the  dimension  of  the 


then  tnere  are  only  v linearly  Independent  functions  from  the  mp  func 


dons  (v 


Therefore,  there  will  only  be 


(v  + l)v/2  Integrals  (summations)  required  to  compute  the  symmetric 
matrix  (V*QV),  and  v to  compute  the  vector  V*Qz. . 


and  either  return  to  1)  or  stop  when  | |y  ' y(^j)| I Is  less  than  some 
specified  small  value. 
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2.  EXAMPLE 

Once  again  consider  the  second  order  system  Introduced  In  the 
examples  of  Sections  IV. 2 •*  IV. 4: 


Xj^(t) 

X2(t) 

- 

» - 

72  (t) 

- 

h *>2 


1 0 
0 1 


Xj^(t) 

x^Ct) 


Xj^(t) 

X2(t) 


u(t)  t e [0,tj]  (280' 


Xj^(O) 

X2(0) 


1 

0 

L J 


(281) 


3 c*  T 

The  nominal  value  of  b c R Is  again  assumed  to  be  b * [■*2  -3  1]  , 

o 

and  so  the  nominal  A matrix  has  eigenvalues  -1  and  q2  *■  -2, 

To  use  the  algorithm  of  the  previous  subsection,  we  must  first 
determine  the  matrices  1 ■ 1,  2,  3,  as  described 

In  Appendix  A.  To  do  so  requires  that  we  Invert  the  Vandermonde  matrix 


A - 


1 

0 

1 

0 


^1 

1 


2 


2q>, 


3 

11 


2q,  3q: 


^4 


1-11  -1 

0 1-23 
1-2  4 -8 

01  -4  12 


(282) 


This  Inverse  Is  easily  shown  to  be 


117 


AFAL-TR-76-118 


A-1. 


-4  4 5 2 

-12  8 12  5 

-9  5 9 4 

-2121 


(283) 


Then  since  there  Is  a single  control  Input  we  may  determine  F and  H 
from  the  equations 


- GA"^ 


(284) 


where  E and  G are  given  by  expression  (236)  of  Section  IV. 3.  Thus»  for 
these  nominal  parameter  values,  the  matrices  F and  H become 


-1  0 


0-10 


-2  0 


0 2 0 


-3  2 


12  1 


-2  -5  -2 


-2  -4  -2 


-1  -3  -2 
-1  -3  -2 


(285) 


-6  2 


14  4 


0-10 


0 2 (T 


Then  the  output  and  output  sensitivities  are  computed  from  the  expressions 
(271)  and  (272)  where 


f(t)  - (e“‘  te"*  e"^*^  te'^^]'^ 


(286) 


Notice  that  the  Integral  /^^(c  - s)u(s)ds  may  be  obtained  as  time-varying 
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combinations  of  the  four  quadrature  Integrals: 

/^e*u(8)d8;  /*8e®u(s)d8;  /*e^'*u(8)ds;  /^8e^®u(8)d8  (287) 

0 0 0 0 

Hence,  there  are  only  four  Integrals  required  to  compute  the  complete 
set  of  parameter  sensitivities  for  each  Iteration  of  the  quaslllnearlza- 
tlon  algorithm.  It  Is  Interesting  to  note  that  If  a sensitivity  system 
differential  equation  approach  were  utilized  In  computing  the  sensitivities, 
there  would  be  either  eight  dlfferentlar.  equations  for  a direct  solution, 
or  a minimum  of  six  if  reduction  techniques  were  applied  (see  the  example 
of  Section  IV. 3).  Thus,  In  either  case  the  number  of  required  differential 
equations  would  be  greater  than  the  number  of  required  quadratures  by  the 
operator  method. 

Once  the  new  estimate  b^  has  been  computed  from  equation  (279) , to 
perform  another  Iteration  of  the  algorithm  requires  a recomputation  of  the 

A 

eigenvalues  of  A(b^) . A first  guess  of  the  new  eigenvalues  can  be  ob- 
tained by  using  the  eigenvalue  sensitivities  computed  from  a Taylor's 

A 

series  expansion  about  b . For  this  example,  the  new  guess  would  be 

o 

where  the  superscript  "o"  here  denotes  the  eigenvalue  and  parameter  com- 

A 

ponents  at  the  Initial  guess  b^.  The  eigenvalue  sensitivities  may  be 
computed  jn  the  basis  of  equation  (157)  of  Section  III.  In  fact,  since 
the  C matrix  Is  Independent  of  any  of  the  parameter  components  which 
appear  In  the  A matrix,  we  may  use  the  column  vectors  of  F to  determine 
these  eigenvalue  sensitivities  (the  column  vectors  of  H could  alternately 
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be  utilized  since  the  parameters  of  A do  not  appear  In  the  B matrix 
either).  Partitioning  F Into  the  form 


p(0)^)  p(0)(2)  F^°^(3)  F^°\4) 


F^^^(4) 


(290) 


ve  may  compute  these  eigenvalue  sensitivities  from  the  relation^: 


F^^^(2)  - 


F^^^(4)  - (q2)(i)^^°^(3) 


(291) 


(292) 


Hence 


^**1^  (2)  “ 

(293) 

^**2^(1)  “ 

^**2^(2)  “ ^ 

<’2>0)  - “ 

(294) 

Thus  the  eigenvalue  sensitivities  may  be  obtained  In  a natural  way  from 
the  other  computations  which  take  place  In  computing  the  parameter 
sensitivities  via  the  operator  method. 

3.  SUMMARY 

In  this  section  we  have  used  the  representation  of  Section  III  to 
obtain  a uew  computational  algorithm  for  quaslllnearlzatlon  parameter 
Identification  In  linear  time-invariant  ordinary  differential  equation 
systems.  This  Is  one  of  the  more  direct  computational  applications  and 
It  shows  how  the  sensitivity  system  differential  equations  may  be  re- 
placed by  the  solution  of  a relatively  low  number  of  quadrature  Integrals. 
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Furthermore,  the  methods  for  computing  the  eigenvalue  sensitivities  from 


blllty"  properties  (Section  IV. 3)  have  been  discussed  In  relation  to 


reducing  the  total  computational  requirements  for  this  approach 


In  Sections  VI  and  VII  we  consider  further  computational  applications 


by  applying  the  results  of  Section  III  to  minimum  sensitivity  control 


design  and  sensitivity  operator  design  for  optimal  parameter  Identlflca- 
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Section  VI 

MINIMUM  SENSITIVITY  CONTROL  DESIGN  FOR 
THE  LINEAR  TIME- INVARIANT  SYSTEM  Sj^^ 

In  Section  II. 6 a gradient  method  for  open-loop  minimum  sensitivity 
control  design  was  developed  for  the  Hilbert  space  linear  system  S^.  In 
this  section  we  apply  this  method  to  the  linear  time-invariant  ordinary 
differential  equation  system,  consider  this  as  a computational 

alternative  to  previously  employed  Rlccatl  equation  techniques  (e.g., 

Kahne  (Ref  70)).  Like  the  previous  section,  this  Illustrates  the  computa- 
tional features  of  the  matrix-operator  form  of  the  parameter  sensitivities 
which  Is  presented  In  Section  III  and  Appendix  A. 

1.  GRADIENT  METHOD  OP  SOLUTION 

To  Illustrate  some  Important  computational  aspects,  the  cost  func- 
tional of  Section  II. 5 Is  generalized  slightly  to  consider  minimization 
of 


JgCu)  - [Y(t^;u)/S^Y(t^;u)] 
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The  real  matrices  and  S(t)  are  assumed  to  be  nonnegative  definite  and 
symnetric,  and  the  real  matrix  0(t)  is  assumed  to  be  symmetric  and  uni- 
formly positive  definite.  The  latter  asstimption  is  required  to  ensure 
existence  of  a unique  minimizing  control.  We  note  that  the  cost  func- 
tional J-(u)  is  quite  general  as  it  weights  the  sensitivities  along  the 
s 

nominal  trajectory  as  well  as  at  the  terminal  time  t^.  Such  a cost  func- 
tional has  been  used  by  a number  of  other  researchers  in  the  minimum 
sensitivity  control  design  problem  (e.g.,  Kahne  (Ref  70)),  and  the  weight- 
ing matrices  may  be  manipulated  to  achieve  various  design  objectives. 

Following  the  approach  of  Section  II. 5,  we  may  separate  Y(t;u)  into 
its  zero-input  portion,  Y (t),  and  zero-state  portion,  Y (t;u). 

Then  using  the  notation  of  the  previous  section  which  is  defined  in  de- 
tail in  Appendix  A,  these  may  be  computed  from  the  matrix-operator 
relations 

Y,  . Ct)  - P f (t)  (297) 

Zele 

— ^f 

Y (t;u)  - G / ^ f(t  - s)u(s)ds.  (298) 

Z.S.  rt 


For  notatlonal  simplicity  we  will  assume  in  this  Section  that 
u(*)  e 12(0, t^:R)  is  a single  control  input  (that  is,  r > 1).  The 
results  are  easily  extended  to  the  multi-input  case.  By  substituting 
equations  (297)  - (298)  into  (295)  and  performing  an  Interchange  of  the 
order  of  integration,  the  cost  functional  ^^(u),  equation  (295),  may  be 
put  into  the  form 

J_(u)  - k + / ' Iu(t)/2h(t)  + P(t)u]dt  (299) 

* 0 


where 
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h(t)  . f^(t,  - t)H's,T^  , (t,) 

+ / * £^^(8  - t)l?  S(8)Y  . (8)d8 

^ z«  x* 

P(t)u  - O(t)uu)  + _ (t-;u) 

t Z Z.8«  t 

+ / ^ £^^(8  - t)H’^  S(8)Y  _ (8;u)d8 

. Z • 8* 


z • £• 


(t)]dt 


(300) 


(301) 


(302) 


The  quadratic  coat  £unctlonal  Jg(u)  may  then  be  minimized  by  gradient 
(or  conjugate  gradient)  Iterations  where  the  gradient  of  dg(u)  Is 

7Jg(t;u)  - 2(h(t)  + P(t)u)  (303) 

t e [0,tj] 

(See,  e.g.,  Luenberger  (Ref  20).) 

For  such  gradient  minimization  techniques,  tne  gradient  function  must 
be  computed  for  each  new  guess  of  the  optimal  control.  Therefore,  a major 
element  of  the  computation  time  for  such  an  approach  will  be  the  computa- 
tion of  the  gradient  function  Itself,  equation  (303).  Therefore,  we  dis- 
cuss the  computational  features  associated  with  computing  this  gradient 
function  from  equations  (301)  - (303). 

The  function  h(<)  must  be  computed  once  and  stored.  The  terminal 
sensitivity  term  (the  first  term  of  equation  (301))  Involves  no  Integrals 
but  merely  matrix  manipulations  and  the  determination  of  the  2n-dlmen8loned 
vector  function  £(*)  from  the  eigenvalues  of  A (see  Appendix  A for  details). 
The  trajectory  sensitivity  term  (the  second  term  of  (301))  requires  2n 
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p 


I 


Integrals  for  each  separate  control  input.  It  la  nost  Important  to  point 

out  that  these  Indicated  convolutions  nay  be  transformed  Into  quadratures 

hy  the  methods  discussed  In  Section  V (see  equation  (274)). 

Hext,  the  function  P(«)u  mast  be  recomputed  for  each  new  guess  of 

the  control  Input.  An  Inherent  part  of  the  computation  Is  obtaining 

T (*;u)  from  equation  (298).  As  discussed  In  Section  V,  this  also  re- 
z.s. 

quires  the  solution  of  2n  quadrature  Integrals  for  each  separate  control 

Input.  Once  computing  Y (*;u)  the  terminal  sensitivity  term  of  P(*)u 

z.s. 

(the  second  term  of  (302))  requires  only  matrix  operations  and  no  addi- 
tional Integrals.  However,  the  trajectory  sensitivity  term  (the  third 
term  of  (302))  again  requires  the  solution  of  2n  Integrals  for  each  con* 
trol  Input.  Again  it  Is  significant  to  note  that  the  indicated  convolu- 
tions may  be  transformed  Into  qtiadratures. 


To  summarize,  the  computational  requirements  for  computing  the 
gradient  function,  equation  (303),  are  quite  minimal  and  Increase  only 
linearly  with  state  dimension  n and  control  dimension  r;  the  computational 
requirements  are  virtually  Independent  of  the  parameter  dimension  p.  It 
Is  also  significant  to  point  out  that  the  terminal  sensitivity  terms 
Inherently  do  not  require  any  convolutions  to  be  solved;  therefore.  If 
the  trajectory  sensitivity  weighting,  S(«),  Is  Identically  zero,  then 
the  gradient  function  requires  no  convolutions  to  be  obtained  and  the 
gradient  method  may  readily  be  applied  to  other  more  general  linear  sys- 
tems (say,  e.g.,  the  linear  time-varying  system  of  Section  III.l  where 
Theorem  1 Is  used  to  compute  the  partial  derivatives  of  the  state  transi- 
tion matrix).  However,  If  S(*)  Is  ncn>zero  then  the  gradient  function 
7Jg(*;u)  requires  convolutions  to  be  computed  (see  the  second  and  third 
terms  respectively  of  equations  (301)  and  (302)),  and,  except  for  the 
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e«s*  of  th«  linear  ttae-lnvarlant  plant  matrix,  thcae  convolutions  cannot 
generally  be  transformed  Into  the  far  more  desirable  quadrature  Integral 
form.  Therefore,  for  the  general  case  with  this  operator-gradient  approach 
the  computational  differences  between  the  terminal  sensitivity  problem  and 
the  trajectory  sensitivity  problem  are  quite  large.  This  Is  not  true  of 
the  Blccatl  equation  exact  method  of  solution  examined  in  the  next  section, 
for  there  the  computational  requirements  are  virtually  the  same  for  both 
the  trajectory  sensitivity  and  terminal  sensitivity  cases. 

Another  Important  factor  In  the  computational  requirements  of  the 
gradient  method  of  solution  Is  the  accuracy  of  an  initial  guess  of  an 
optimal  control.  The  Initial  guess 

Ug(t)  - -0‘^(t)h(t)  (304) 

might  be  selected  for  this  control  minimizes  the  functional 

Iu(t)/2h(t)  + OCOuCt)]  (305) 

and  this  Is  an  approximation  to  the  sensitivity  cost  functional  Jg(u), 
equation  (299).  However,  If  the  optimal  control  for  one  set  of  condi- 
tions has  been  obtained  and  these  conditions  are  perturbed  slightly  (say, 
for  example.  If  a new  set  of  Initial  conditions  are  selected  or  If 
linearization  about  a new  nominal  trajectory  In  a quaslllnearizatlon 
nonlinear  system  optimization  Is  desired),  then  the  old  optlhal  control 
may  be  used  as  the  first  guess  with  the  new  conditions.  Since  the  quad- 
ratic functional  Jg(u)  Is  continuous.  If  the  new  conditions  are  not 
perturbed  greatly  from  the  old,  the  new  optimal  control  will  be  "close" 
to  the  old  optimal  and  convergence  should  be  quite  rapid.  In  general, 
this  Is  an  advantage  of  Iterative  solution  techniques  over  exact  methods 
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of  solution  In  that  prior  Information  such  as  this  may  be  utilized  to 
reduce  computational  requirements. 

Finally,  we  comment  that  the  gradient  type  approach  may  be  conven- 
iently applied  to  a number  of  variations  to  the  basic  quadratic  minimum 
sensitivity  control  problem  which  has  been  considered  here.  For  Instance, 
If  It  Is  required  that  the  system  output  at  the  terminal,  y(t^;u),  meet 
a terminal  equality  constraint^ 

y(t-;u)  - F^°^f (t,)  + / * f (t-  - t)u(t)dt 

* * 0 * 


(306) 


and  If  has  rank  m (l.e.,  the  system  Is  output  controllable),  then  the 

gradient  projection  method  of  Rosen  (Ref  24)  may  be  easily  applied  to 
solve  the  terminal  equality  constraint  mlnlimsB  sensitivity  control  prob- 
lem. (See  Luenberger  (Ref  20:  297  - 299)  for  the  computational  details 

of  this  method.)  Since  the  projection  operator 

r - 1 - f^(t-  - t)G^°^^(G^°yG^°^^  G^°V^^  f(t-  - t)(-)dt  (307) 

* C z 


«(t-  - t)f*(t.  - t)dt 
« 0 * * 


(308) 


used  In  this  method  Is  easily  calculated,  there  Is  very  little  computa- 
tional Increaie  for  solving  this  terminal  equality  constraint  problem. 

3.  RICCATI  EQUATION  SOLUTION 

In  the  previous  section  a gradient  approach  to  minimizing  the  quad- 


^See  Appendix  A for  the  definition  of  the  m x 2n  and  m x 2nr  dimensioned 
Mtrlces  f(0)  and  G'®^. 
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ratio  sensitivity  cost  functional,  Jg(u),  equation  (295),  was  examined. 

The  results  of  Section  III  and  Appendix  A were  applied  and  some  highly 
efficient  computational  features  were  obtained  for  the  time-invariant 
system  S^-.  However,  often  an  explicit  solution  to  the  open-loop  minimum 
sensitivity  control  problem  would  be  desired  rather  than  the  iterative 
methods  of  the  previous  section.  In  such  a case  a Rlccatl  equation  method 
of  solution  is  a well-known  technique  for  calculating  the  open-loop 
minimum  sensitivity  control  law  in  linear  systems.  (See,  e.g.,  Kahne 
(Ref  70).)  We  emphasize  the  fact  that  the  control  law  determined  in 
this  manner  is  only  the  open-loop  solution  and  cannot  be  made  closed-loop 
(except  by  approximation  - see,  e.g.,  Lamont  and  Kahne  (Ref  72))  as  is  the 
case  for  a Rlccatl  equation  solution  when  sensitivity  is  not  Included  in 
the  cost  functional.  This  fact  is  well-known  (e.g..  Price  and  Deyst 
(Ref  75))  and  stems  from  the  fact  that  the  optimal  control  is  computed 
from  a linear  combination  of  both  the  optimal  nominal  state  and  state 
sensitivities  (see  equation  (312)).  However,  the  sensitivities  are 
computed  under  the  assumption  that  the  control  is  open-loop,  and  so  the 
form  becomes  mathematically  Incorrect  when  used  in  a closed-loop  (feed- 
back) manner. 

Using  the  time-invariant  sensitivity  system  of  Section  III 


X(t;u)  - AX(t;u)  + B u(t) 

X(0)  - d 

(309) 

Y(t;u)  - CT(t;u) 

t t I0,t^] 

(310) 

(see  equations  (92)  - (94)),  the  cost  functional  dg(u),  equation  (295), 


ly  be  written  as 
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J^(u)  - [X(t^;u)/?S^CX(t^;u)] 

+ / ^ [X(t;u)/C^S(t)CX(t;u)]dt 
0 

+ / * [u(t)/0(t)u(t)]dt  (311) 

0 

In  this  form  the  minimum  sensitivity  control  problem  Is  a standard  linear 
system  quadratic  cost  functional  optimal  control  problem  which  has  the 
well-known  solution  given  by  (e.g.,  Bryson  and  Ho  (Ref  5)): 

u*(t)  - -(0(t))"^B’^(t)X*(t)  (312) 

where  R(t),  t e [0,t^]  Is  obtained  via  the  unique  solution  of  the 
n(p  + 1)  X n(p  + 1)  matrix  Rlccatl  differential  equation 

R(t)  - -R(t)A  - 7^  R(t)  + R(t)B  0"^(t)B^  R(t)  - ^S(t)c’  (313) 

with  terminal  boundary  condition 

R(t^)  - SjC.  (314) 

The  n(p  1)  augmented  vector  function  X (•)  Is  computed  by  Integrating 
forward  the  sensitivity  system  differential  equations  (309)  using  the 
optimal  control  computed  from  (312). 

Mow  let  us  review  the  computational  requirements  of  this  Rlccatl 
aquation  method  of  solution.  Without  reducing  the  dimension  of  X(*;u) 
(see  Section  IV. 3)  the  symmetric  Rlccatl  matrix,  R(*)>  will  have  dimen- 
sion n(p  -f  1)  X n(p  + 1)  and  so  the  number  of  differential  equations 
Increases  In  rough  proportion  to  the  square  of  n and  the  sqr-are  of  p. 
However,  If  the  methods  of  Section  IV. 3 (or  other  approaches  to  minimal 
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order  aensltlvlty  models  — see  (Refs  34  - 42))  are  appliedt  then  the 
dimension  of  X(«iu)  nay  be  reduced  toy^^^  2n(r  + 1)  (see  equations  (226)  - 
(228)).  Therefore,  for  linear  tlme>-lnvarlant  systems  the  computations 
for  the  Rlccatl  equation  approach  can  be  made  to  Increase  only  with  the 
squares  of  n and  r and  be  independent  of  the  parameter  dimension  p.  This 
Is  In  contrast  to  the  gradient  method  of  solution  where  the  amount  of 
computation  Increases  only  linearly  with  n and  r and  Is  also  independent 
of  the  parameter  dimension  p.  Thus  for  systems  with  large  state  dlmen'- 
slon,  n,  the  gradient  approach  offers  the  potential  of  considerable 
computational  reduction. 

Finally,  we  note  that  the  Rlccatl  method  of  solution  may  also  be 
utilized  In  cases  in  which  a terminal  output  condition  y(t^;u)  - y^ 
must  be  achieved  (see,  e.g.,  Bryson  and  Ho  (Ref  5:  158  - 163)). 

Bryson  and  Ho  describe  the  method  in  considerable  detail;  therefore,  we 
do  not  repeat  these  degtalls  here,  but  merely  comment  that  an  additional 
linear  matrix  differential  equation  of  dimension  n(p  + 1)  x n (equation 
(5.3.35) (Ref  5:  161))  must  be  solved  in  using  such  a technique. 

In  the  next  subsection  we  consider  a simple  second  order  example 
to  help  Illustrate  the  computational  features  of  the  operator-gradient 
method  and  also  point  out  the  computational  differences  between  Iterative 
gradient  method  and  the  Rlccatl  equation  exact  method  of  solution. 


3.  EXAMPLE 


Once  again  we  consider  the  second  order  system 
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with  nominal  parameter  vector  b 


was  considered  In  Sections  IV. 2,  IV. 3,  IV. 4,  and  V.2,  we  will  use  many 


of  the  results  from  those  sections  to  shorten  the  discussion  here 


As  described  In  Section  V.2  we  may  write  tne  zero-lnnut  and  zero' 


where  F,  H,  and  f(t)  are  given  by  equations  (285)  - (286).  Note  that  only 


four  quadrature  Integrals,  equation  (287),  are  required  In  computing 


We  will  consider  minimizing  both  a terminal  sensitivity  .cost  func' 


J,(u)  - r u(t)dt  + c,[Y(l;u)/Y(l;u)] 


and  the  trajectory  sensltlvl'-y  cost  functional 


J,(u)  - J,(u)  + c,  /^tY(t;u)/Y(t;u)]dt 


where  the  scalar  constants  c.  and  c,  may  be  assigned  various  weighting 


values.  The  gradient  of  the  terminal  sensitivity  cost  functional  may 
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7Jj(t;u)  - 2(hj(t)  + P^(t)u) 


(321) 


where 


h,(t)  - - t)^  . (1) 

1 1 z«i« 


(322) 


P,(t)u  - u(t)  + c,f*(l  - t)l?  Y^  ^ (l;u) 
X X z • 9 * 


(323) 


f'’^(l  - t)  - [e^"^  (l-t)e^‘^  (l-t)e^^*^"^^  ] 


(324) 


Notice  that  the  only  Integrals  required  to  compute  the  Pj^(t)u  term  are 
the  four  quadratures  required  in  computing  Y (t-.u)  from  eqtiatlon  (318). 

Next  consider  the  gradient  of  the  trajectory  sensitivity  cost  func- 
tional, equation  (320).  This  may  be  computed  from 


7J2(t;u)  - 2(h2(t)  + P2(t)u) 


(325) 


»d»ere 


h2(t)  - hj^(t)  + C2  f'^^Cs  - t)ff’^  Y^^^  (s)d8 


(326) 


P,(t)u  - P.(t)u  + c,  f''^(s  - t)i?  Y^  _ (8;u)d8 
Z X i ^ Z.8. 


(327) 


Note  that  we  may  work  with  f^(8  - t)  In  the  factored  form 


.T,  rt-s  t,  -8^  -Sv  2t  -28  2t,  -2s  ^ -2s. 


f (s  - t)  • [ 


e e e 


•/  "8  ..  “Sv  zc  -zs  zr,  -ZS  ^ -ZS.  T /,oo\ 

(se  -te  ) e e e (se  -te  ) J (328) 


In  order  to  transform  the  four  convolutions  of  equations  (326)  and  (327) 
Into  four  quadratures  each.  Thus,  for  this  trajectory  sensitivity  gradient 
there  ate  a total  of  four  quadratures  to  compute  the  Invariant  term  h2(*)> 
and  there  are  a total  of  eight  to  compute  the  term  P2(*)u.  Notice  that 


A 
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I 

i 

r 

[ 

I 

i 


this  number  of  Integrals  Is  only  dependent  upon  the  dimension  of  f(*) 
and  Is  Independent  of  the  number  of  parameter  components  or  upon  any 
special  "controllable"  form  of  Y (t;u)  (see  Section  IV. 3). 

Z e 8 • 

To  use  a gradient  method  In  computing  the  optimal  control  for 
minimizing  either  Jj^(u)  or  J^(u)  requires  that  we  start  with  an  Initial 
guess  of  the  optimal  control  (see,  e.g.,  Luenberger  (Ref  20:  285)).  We 

choose  the  Initial  guess  suggested  In  Section  VI. 1 as  follows: 

u^(t)  - -h^(t),  1 - 1 or  2;  (329) 

for,  this  Is  the  control  which  Is  the  unique  minimum  of  the  approximate 
cost  functional 

(u(t)  + h.(t))u(t)dt 
0 ^ 

Then  we  may  use,  for  example,  a standard  steepest  descent 
(e.g.,  Luenberger  (Ref  20:  285))  to  give  the  sequence  of 

estimates 

«j+l(t)  - Uj(t)  + kjrj(t)  (331) 

where  r^ (• ) Is  one-half  the  negative  gradienc 


(330) 

algorithm 
optimal  control 


\ 


rj(t)  - -(h(t)  + P(t)uj) 


(332) 


and 


" A 


/ r . (t)r, (t)dt 
0 J J 


/ r-(t)P(t)r.dt 
0 J J 


(333) 


Ths  sequence  Is  stopped  when  the  norm  of  fj(*)  sufficiently  small  or 
when  there  Is  no  longer  any  appreciable  decrease  In  the  cost  functional. 
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Notice  that  for  each  step  of  iterations  (except  the  first)  we  must 
tise  the  subroutine  P(t)u  just  one  time  since  we  may  write  P(t)Uj^j^  as 

P(t)Uj^j^  - P(t)Uj  + (334) 

and  both  P(t)Uj  and  P(t)rj  have  been  determined  from  the  previous  Uj 
step.  Thus,  In  this  example,  for  each  new  Iteration  of  the  control  there 
would  be  four  quadrature  Integrals  required  If  the  cost  functional  is 
J^(u)  or  there  would  be  eight  If  the  cost  functional  Is  J^Cu). 

The  above  mentioned  steepest  descent  algorithm  was  programmed  on 
the  CDC  6600  digital  computer.  Using  trapezoidal  integration  with  a step 
size  of  .002  on  the  interval  [0,  11,  the  optimal  minimum  sensitivity  con- 
trol law  was  computed.  The  step  size  requirement  is  a function  of  the 
accuracy  desired,  the  time  constants  of  the  system  (the  magnitude  of 
the  eigenvalues),  and  the  accuracy  of  the  Integration  scheme  selected. 

The  step  size  of  .002  was  chosen  In  order  to  give  reasonable  accuracy 
between  the  sensitivities  computed  using  the  operator  form  with  trape- 
zoidal Integration  and  those  computed  using  the  sensitivity  system 
differential  equations  with  a fourth  order  Runge  Kutta  integration. 
However,  far  more  accurate  integration  techniques  then  trapezoidal 
could  be  utilized  and  the  .002  Interval  could  then,  no  doubt,  be 
lengthened . 

To  Illustrate  the  Influence  of  the  cost  functional  upon  the  speed 

of  convergence,  the  scalar  constants  c^  and  C2  were  ass^ned  values  of 

(10,  1)  and  (100,  10).  The  opt5mal  control  for  both  the  cost  functionals 

J^(u),  equation  (318),  and  1^(0),  equation  (319),  were  computed.  The 

results  of  the  Iterations  are  shown  In  Tables  1 and  2.  Notice  that  for 

the  set  of  constants  (10,  1)  the  Initial  guess  u (t)  ■ -h. (t),  1»1,2,  Is 

o 1 
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fairly  close  to  the  optimal  solution,  and  the  optimal  control  Is  reached 


in  just  three  to  four  steps.  For  the  more  heavily  weighted  constants  of 


(100,  10),  the  Initial  guess  u^(t)  ■ -h^(t).,  1 ■ 1,  2,  Is  further  away 


from  the  optimal  solution  and  the  gradient  method  takes  longer  to  reach 
convergence.  Notice  that  for  the  set  of  weighting  constants  selected 
here,  the  speed  of  convergence  for  both  the  "terminal”  sensitivity  prob- 
lems, Jj^(u),  or  the  "trajectory"  sensitivity  problems,  J2(u),  are  roughly 
the  same. 

Next  consider  the  solution  of  this  example  by  the  Rlccatl  equation 
technique  described  In  Section  VI. 2.  For  this  example,  the  C matrix  Is 
the  Identity  matrix  and  so  Y(t;u)  “ X(t;u).  Also,  recall  from  the 
analysis  of  this  same  example  In  Section  IV. 3,  that  the  rank  of  the 
matrix  [E  G]  is  6;  therefore,  the  eight  elements  of  the  sensitivity 
system,  X(*;u),  may  be  calculated  from  a minimum  of  six  differential  equa- 
tions rather  than  eight.  Indeed,  from  the  results  of  the  example  of 
Section  IV. 3 we  have 


X(t;u) 


I 

0 0 2 0 

0 10  2 


3 1 

-2  0 


X^(t;u) 


8x6 


5 KX  (t;u^ 
c 


(335) 


where 


Xg(t;u) 


Xj^(t;u) 


^2 (t;u) 


(t;u) 


5^^^(t;u) 


.(2) 


(t;u) 


e^^^t;u) 


(336) 
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COMPUTATIONAL  RESULTS  OF  GRADIENT  METHOD 


Table  1:  c 


Table  2:  c.  ■ 100,  c 
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Nov  In  terns  of  X (t:u)  the  cost  functionals  J, (u)  and  J_(u)  nay  be 
c 1 Z 

written  as 


J,(u)  - u^(t)dt  + c,[X^a;u)/K'^ra  (l;u)l 
JjCu)  - JjCu)  + /^[X^(t;u)/K'^KX^(t;u)]dt 


(337 

(338 


The  six  elenents  of  the  "state"  X^(t:u)  are  then  detemlned  front 

X (t;u)  - A X (t;u)  + B u(t) 
c c c c 

X^(0;u)  - d^  t c [0,  1] 


(339 


(340 


where 


Is  uniquely  specified  by  the  equation 

u*(t)  - -1^  R2(t)XJ(t;up  t c [0,  1]  (342 

where  R2(t)  Is  obtained  by  Integrating  backwards  In  time  the  6x6 
dimensioned  matrix  Rlccatl  differential  equation 

R,(t)  - -R,(t)A  - R,(t)  + R^,(t)B^B^  R,(t)  - c A 


(343 
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with  terminal  boundary  condition 

¥2(1)  " 

The  optimal  "state"  trajectory,  X^Csu^)  Is  obtained  by  Integrating  for- 
ward In  time  the  equation  (339)  with  the  control  (342).  The  optima) 
control  which  minimizes  J^(u)  is  obtained  In  a similar  manner  except  the 

differential  eqiiatlon  for  R^(t),  equation  (343),  no  longer  has  the  last 

T t 

term,  C2K  K.  Notice  that  In  either  case  there  are  21  nonlinear  differen- 
tial equations  to  determine  the  symmetric  R^(«)  matrix  plus  an  additional 
six  linear  differential  eqiiatlons  to  determine  the  optimal  trajectory 
X*(’;u*).  Additionally,  six  function  elements  of  R^(*)  must  be  stored  In 
order  to  compute  the  optimal  control  from  equation  (342) . Finally,  the 
amount  of  computations  are  virtually  the  same  regardless  of  whether  the 
cost  functional  Is  J^(u)  or  J2(u).  This  Is  In  contrast  to  the  gradient 
method  In  which  the  amount  of  computation  was  doubled  for  the  trajectory 
sensitivity  cost  functional  J2(u)  (eight  quadratures  per  Iteration  versus 
four  for  the  terminal  sensitivity  cost  functional  J^(u)). 

3.  SUMMARY 

In  this  section  we  have  examined  computational  techniques  for  obtain- 
ing the  open-loop  minimum  sensitivity  control  law  for  the  linear  time- 
invariant  system,  Using  the  matrix-operator  form  of  the  parameter 

sensitivities  the  gradient  approach  developed  In  Section  II. 5 was  applied. 
This  proved  to  be  an  efficient  method  of  solution  In  which  the  amount  of 
; computation  (number  of  quadrature  Integrals)  grew  only  linearly  with  the 

state  dimension  n and  the  control  dimension  r.  A simple  second  order 
example  system  showed  that  the  computations  were  quite  straight-forward 
to  apply  and  that  convergence  on  the  digital  computer  could  be  quite  rapid. 
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Next,  this  operator-gradient  method  of  solution  was  compared  with 
well-known  Rlccatl  equation  solution  methods.  By  applying  sensitivity 
system  reduction  techniques  (Section  IV. 3)  the  amount  of  computations 
could  be  reduced  substantially.  However,  the  amount  of  computation 
(nimber  of  nonlinear  differential  equations  for  solution  of  the  matrix 
Rlccatl  differential  equation)  still  grew  In  proportion  to  the  square  of 
n and  r as  opposed  to  linearly  with  the  n and  r for  the  gradient  method 
of  solution.  Thus,  even  with  sensitivity  system  reduction  meth'ods,  the 
gradient  solution  offers  a practical  alternative  to  computing  the  open- 
loop  minimum  sensitivity  control  law. 

In  the  next  section  we  consider  our  final  computational  application 
of  the  matrix-operator  representation  of  the  parameter  sensitivities  In 
linear  tlme-lnvsrlant  ordinary  differential  equatlon< systems.  There  we 
examine  optimal  design  of  the  sensitivities  to  enhance  parameter  Identi- 
fication capability,  and  once  again  the  matrix-operator  form  provides  a 
useful  alternative  to  differential  Rlccatl  equation  solution  techniques. 
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Section  VII 

SENSITIVITY  DESIGN  FOR  OPTIMAL  IDENTIFICATION  IN  LINEAR 
TIME-INVARIANT  ORDINARY  DIFFERENTIAL  EQUATION  SYSTEMS,  Sj^ 

In  this  section  we  consider  the  computation  of  the  initial  condition 
vector  and  control  Input  which  optimizes  parameter  Identification  capa- 
bility from  the  zero- Input  and  zero-state  responses,  respectively.  The 
methods  presented  are  based  upon  the  discussion  of  Section  II. 8,  and  like 
the  previous  two  sections  we  apply  the  matrix-operator  form  for  the  param- 
eter sensitivities  In  linear  time-invariant  ordinary  differential  equation 
systems  (Section  III  and  Appendix  A)  In  order  to  obtain  efficient  computa- 
tional techniques  for  solution. 

The  concept  of  obtaining  an  optimal  Initial  condition  vector  Is  new; 
however,  the  approach  for  obtaining  the  optimal  Input  function  are  based 
upon  the  work  of  Mehra  (see  Section  I.l.e  and  Section  II. 8 for  discussion). 
Mehra  suggests  solving  the  optimal  control  Input  design  problem  via  a 
Raylelgh-Ritz-Galerkln  algorithm,  and  that  Is  the  method  which  we  employ 
here  for  our  operator  approach.  Therefore,  like  Section  V the  contribu- 
tion here  Is  to  demonstrate  how  the  matrix-operator  form  of  the  parameter 
sensitivities  can  provide  a useful  computational  alternative  to  previously 
employed  two-point  boundary  value/Rlccatl  equation  methods  of  solution 
(e.g.,  Stepner  and  Mehra  (Ref  120)  and  Gupta  and  Hall  (Ref  108)).  The 
computational  utility  of  the  matrix-operator  form  Is  further  enhanced 
through  the  use  of  Walsh  functions  as  the  orthogonal  set  of  basis  func- 
tions In  the  Raylelgh-Rltz-Galerkln  algorithm.  The  special  properties 
of  the  Walsh  functions  allow  us  to  substantially  reduce  the  number  of 
lnt'3gral8  required  for  solution.  This  method  of  reduction  with  the  Walsh 
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functions  should  have  useful  applications  to  other  aspects  of  linear 
systen  sloulatlon  and  control  design. 

Again  the  simple  second  order  linear  system  examined  In  Sections  IV^- 
VI  will  be  used  here  to  Illustrate  the  computational  implications  of  the 
operator  algorithms. 


1.  OPTIMIZATION  OF  THE  ZERO-INPUT  SENSITIVITIES 

For  the  system  S^  the  zero-input  cost  functional  (see  Section  II. 8. b) 
takes  the  form 


l.J-1 


? ky  (C*  (t) ) (i)Q(t)  (C«  (t)  ) jd]dt 

l,j-l 


(345) 


The  m X m matrix,  Q(t),  Is  assumed  to  be  symmetric,  positive  definite,  and 
uniformly  bounded  for  t e [0,t^].  It  may  be  assumed  that  Q~^(t)  Is  the 
covariance  matrix  of  a Gaussian  white  noise  measurement  process  (see  Sec- 
tion II. 6. d). 

Using  Theorem  4 and  defining  the  n x n positive,  symmetric  matrix 


2n  2n  p ^ _ t. 

Aj^  s £ E { E k^j(CA''"^)‘^j  / a^(t)Q(t)a^(t)dt(CA^“b^ 

*•1.  1 0 


ie«l  £■!  1,J«1 


we  may  write  (d)  as 

'^z.l. 


(d)  - td/A^  dl 
z.l.  z.l. 


(346) 


(347) 
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-TT' 


Computing  the  symmetric  matrix  A^  Involves  merely  matrix  manlpula- 

z • 1« 

tlons  and  solving  the  qtiadrature  Integrals 


/ ‘a^(t)Q(t)a,(t)dt 
0 ’’ 


(348) 


K*!*  2«  •••  2n 
I ■ 2n,  2n-l,  ...k 


Also,  after  solution  of  these  Integrals  one  time.  Iterative  adjustment 


to  the  weighting  constants,  l.J  ■ 1,  2,  ...p,  becomes  a simple  process 


since  these  Integrals  do  not  change.  . Therefore,  the  constants  may 


readily  be  adjusted  to  satisfy  various  optimization  criterion  (see  Section 
II. 8. a). 

According  to  the  development  of  Section  II. 8. b,  we  place  the  auxiliary 
constraint  on  the  Initial  condition  vector 


[d/d]  1 1 


(349) 


We  may  Interpret  this  constraint  as  providing  the  unit  vector  initial 
"direction"  of  maximum  Information.  Then  the  d*  e R**  which  maximizes 


'K 


(d)  Is  the  eigenvector  corresponding  to  the  maximum  eigenvalue  of 


z . 1 . 


the  positive,  symmetric  matrix,  A^  , and  normalized  so  that 

z . 1 . 


[d*/d*]  - 1. 


(350) 


Since  Is  positive  and  symmetric,  rapid  computer  techniques 

z.l. 

exist  for  obtaining  the  maximal  eigenvalue  and  corresponding  eigenvector. 


(See,  e.g.,  Fadeev  and  Fadeeva,  (Ref  11:  406  - 454).)  Also,  It  may  be 


desirable  to  have  an  optimization  algorithm  which  will  maximize  the 


functional  (345)  over  any  given  subspace  R . For  example,  the  Raylelgh- 
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Uti-Galcrkln  algorithm  described  In  detail  In  the  next  section  has  this 
particular  oropertyt  and  It  Is  easily  modified  to  solve  this  finite 
dimensional,  matrix  eigenvector  problem.  Then,  when  Implementing  a 
procedure  stich  as  suggested  In  Section  IV. 2 for  reducing  the  number  of 
Influencing  parameter  components  (see  the  discussion  following  Corollary 
S.l),  one  may  not  only  select  an  Initial  condition  vector  which  Is  from 
the  Insensitive  subspace  of  one  group  of  parameter  components,  but  he  may 
also  select  the  Initial  condition  vector  which  Is  optimum  with  respect  to 
Identifying  the  remaining  parameter  components. 

In  conclusion,  the  method  of  Section  II. 8 Is  well  suited  for  deter- 
mining an  optimal  Initial  condition  vector  for  the  system  S^.  Indeed, 
with  very  little  additional  computations,  the  method  may  be  applied  to 
find  the  optimal  Initial  condition  vector  for  the  time-varying  linear 
system,  S^.  Such  techniques  should  be  valiiable  xdien  the  system  deslgnei 
has  the  freedom  to  select  the  Initial  condition  vector,  and  he  Is 
attempting  to  obtain  the  greatest  "Information”  for  Identifying  the  sys- 
tem parameters  from  the  transient  response. 

Example 

Consider  the  second  order  homogeneous  linear  system 

t e (0,  1]  (351) 

with  the  single  observable  output 

(352) 

The  nominal  parameter  values  are  b^^  ■ -2  and  b2  " -3;  thus  the  nominal  A 


y(t)  - (1  0] 


X2(t) 

X2(t) 


X2(t) 

1 

o 

1 

X2(t) 

X2(t) 

1 

"2_ 

X2(t) 
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Mtrlx  is  the  same  as  the  examples  of  Sections  IV.  V,  and  VI  and  so  we 
will  use  some  of  the  previous  results  to  shorten  the  discussion  here. 
Wa  may  then  compute 


(353) 


(CA^)^j  - [1  0]  (CA^)(2)  “to  1] 


(354) 


'CA^)^^j  - [-3  1]  (CA^)(2)  “ t-2  -61 


(355) 


Assuming  Q(t)  > 1 for  all  t e [0,1]  and  that 


( 1 1-  J 
( 0 1 ^ J 


(356) 


we  then  obtain 


ic,l-l  l,j-l 


I CCA''’^)(l)CCA^“^)yj 


• I a (t)a  (t)dt 
0 ‘ 


a2(t)a2(t)dt 


[; :]  r 

.[■-  •■] . 

h-l  -12 J 0 

T”  ’1 

| 9 37 J 0 


aj(t)a^(t)dt 


a^(t)a^(t)dt 


(357) 
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From  the  Inverted  Vandermonde  matrix  of  the  example  In  Section  V,  equation 
(283),  we  may  determine  the  scalar  functlras  a^C*)  and  a^(«)  to  be 


a^Ct)  • -9e  ^ + 5te  *"  + 9e  + 4te 


a^(t)  ■ -2e  * + te  ^ + 2e  + te 


(358) 


(359) 


Then 


r a,(t)a,(t)  = .022 
0 ^ ^ 


r = .0026 


a^(t)a^(t)  : .0003. 


Using  these  values  gives  the  approximate  value 


(360) 


(361) 


(362) 


2.1. 


.010  0 

0 .002 


(363) 


Thus  ■ .01  Is  the  largest  eigenvalue  of  A^^  , and  the  eigenvector 

z . 1 . 

corresponding  to  this  eigenvalue  Is  8jj  ■ [1  0]  Hence  the  optimal 

Initial  condition  vector  for  Identifying  the  parameters  in  the  system 
(351)  - (352)  Is  one  selected  In  the  direction  6^^. 

2.  OPTIMIZATION  OP  THE  ZERO-STATE  SENSITIVITIES 

For  simplicity.  In  this  section  we  will  assume  a single  control  Input 
(that  Is,  r ■ 1)  and  that  the  weighting  matrix  Q Is  t Ime- Invar lant . The 
extensions  to  the  more  general  cases  are  easily  made.  Then  for  the  sys- 
tem S^^,  the  zero-state  Input  design  cost  functional  (see  Section  II. 8. c) 
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taka*  th*  fora 


Ualng  the  results  of  Section  III  and  Appendix  A,  the  zero-state  sensitiv- 


ities nay  be  computed  from  the  relations 


/ f(t  - 8)u(s)ds 


inhere  f(*)  Is  a 2n-dlmensloned  vector  function  defined  In  Appendix  A and 
which  has  elements  of  the  form  f'e  for  real  eigenvalue  of  A,  q.  ; and 


p,  are  m x 2n  dimensioned  matrices  computed  as  des 


crlbec  In  Appendix  A.  Using  this  relationship  (365)  and  Interchanging 


the  order  of  Integration,  we  may  put  the  cost  functional  (364)  Into  the 


Iu(t)/Aj,  (t)u]dt 


irtiere  we  define  the  positive  self-adjoint  operator 


C)u(i;)dc)db 


Assume  that  the  control  must  satisfy  the  "energy"  constraint 


See  footnote  of  Section  II. 8. c,  page  51 
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/ ' [u(t)/u(t)]dt  < 1.  (368) 

0 


Then  this  problem  Is  a complete  parallel  to  that  of  Section  VII. 1.  There- 
fore, the  optimal  Input  function  which  maximizes  the  cost  functional  (366) 
subject  to  the  constraint  (368)  Is  the  eigenfunction  of  the  positive,  self- 
adjoint,  compact  operator,  A^  , which  corresponds  to  Its  maximal  elgen- 


z.s. 


value. 


This  conclusion  Is  a restatement  of  the  results  of  Mehra  (Ref  112) , 
and  the  motivation  for  our  approach,  both  In  this  section  and  the  previous 
one.  Is  based  entirely  upon  that  worR.  However,  the  contribution  which  we 
wish  to  make  here  Is  to  use  the  new  matrix-operator  representation  of  the 
parameter  sensitivities,  equation  (365),  to  form  an  efficient  computational 
algorithm  for  computing  the  optimal  Input  function.  This  algorithm  is 
developed  In  the  next  subsection,  and  then  It  Is  compared  with  previously 
employed  Rlccatl  equation  techniques  In  the  Subsection  VII. 2. b. 


a.  Rayleigh-Ritz-Galerkln  Algorithm  for  Computing  the  Maximal 
Eigenfunction 

Mehra  (Ref  112)  suggests  solving  the  optimal  Input-maximal  eigenfunc- 
tion problem  by  means  of  a Raylelgh-Rltz-Galerkln  method  of  solution; 
however,  he  does  not  present  the  computational  details  for  this  method. 

The  Rayleigh-Rltz-Galerkln  method  Is  the  solution  technique  which  we  elect 
to  adopt  using  the  matrix-operator  form  of  the  parameter  sensitivities, 
equation  (365).  This  method  has  the  desirable  property  that  one  may 
select  "basis"  functions  which  are  easy  to  Implement  once  the  optimal 
control  function  Is  determined  as  linear  combinations  basis  functions. 

For  example,  either  sinusoids  or  Walsh  functions  (Ref  126)  night  be 
utilized  as  the  orthogonal  set  of  basis  functions  and  a linear  combination 
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of  either  of  these  might  be  easy  to  Implement  In  a physical  control  sys- 
tem. On  the  other  hand,  if  the  optimal  Input,  u*.  Is  determined  as  an 
arbitrary  element  of  L2(0,t^;R),  It  may  be  difficult  to  Implement  without 
approximation.  Also  In  using  the  Raylelgh-Rltz-Galerkln  method,  the  sys- 
tem designer  has  the  freedom  to  select  any  level  of  complexity  (and 
accuracy)  of  the  optimal  input  by  choosing  the  number  of  basis  functions 
over  which  he  optimizes. 

The  details  of  the  Raylelgh-Rltz-Galerkln  method  of  solution  are  now 
stated.  Notice  that  in  using  this  technique,  we  utilize  equation  (364) 
rather  than  (366),  as  this  form  proves  to  be  less  sensitive  to  numerical 
errors,  has  slightly  less  total  number  of  Integrals,  and  may  easily  be 
extended  to  time-varying  linear  systems  by  computation  of  the  parameter 
sensitivities  from  the  "sensitivity  system". 


Raylelgh-Rltz-Galerkln  Algorithm 

1)  Given  the  nominal  A matrix,  compute  Its  distinct  eigenvalues 
k ■ 1,  2,  ...p,  and  the  multiplicities,  n^^,  of  each  eigenvalue  In 

the  characteristic  polynomial  of  A. 

2)  Using  the  eigenvalues  of  A,  form  and  Invert  the  2n  x 2n 

Vandermonde  matrix  A as  defined  in  Appendix  A.  Using  the  elements  of 
A”^  and  the  quantities  A,  B,  C,  ^(1)*  ^ 2.  ...p, 

compute  the  p m x 2n  dimensioned  matrices^ 


- [<CB)^j  ^^^(1)  •••  (CA^"'^B)^j]A 


(369) 


^Note  that  this  equation  holds  because  r ■■  1;  If  r were  greater  than  one 
(more  than  one  control  Input)  then  a similar  equation  would  apply  to  each 
column  vector  of  B. 
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3)  For  Che  set  of  weighting  constants  k.  i,j  * 1,  2,  ...p,  and 

!■»  J 

the  m X m positive-definite  symmetric  matrix  Q,  compute  the  2n  x 2n 
dimensioned  matrix 


l.J-1 


if  J 


(370) 


4)  For  an  arbitrary  input  function,  u e L2(0,t^;R),  form  a subroutine 


to  compute  the  2n-dlmensloned  vector  functions  of  t e [O.t^l 

4(t;u)  i f(t  - s)u(s}ds 
0 


(371) 


where  the  2n-dimensloned  vector  functions  are  defined  in  Appendix  A and 
have  elements  of  the  form  f'e  for  real  eigenvalue  q,  of  A (or 

4 Ofct  , Oj^t  ^ 

f^e  cosio^t  and  f'e  slnw^t  for  complex  eigenvalues  “ ®k  — 

It  is  important  to  point  out  that  the  indicated  2n  convolution  integrals 
of  equation  (371)  may  be  computed  from  2n  quadratures  through  the  rela- 
tions such  as 


ft  t ’k'  -V  V -’k* 

(t  - 8)e  ■ te  e - e se 


(372) 


Comment 


At  this  point  we  are  prepared  to  begin  the  actual  Rayleigh-Rltz- 
Galerkln  optimization.  It  is  assumed  that  a set  of  n^  basis  functions, 
“j(t)t  J “ 1,  2,  ‘..nj^,  t e t0,t^]  are  prespecified.  For  computational 
simplicity  it  is  assumed  that  these  basis  functions  are  selected 
mutually  orthogonal;  chat  is,  it  is  assumed  Chat 


/ ‘ Uj  (t)Uj^(t)dt 


■r 

( 0 J ^ 


(373) 
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S)  Compute  and  store  the  n^^  x real  symmetric  matrix,  V^,  with 


row  and  column  element  given  by 


Vp(K,l) 


? k A[v^^^t;u  )/Qv^^\t;u.)ldt 

i»j  0 ^ 

t,J-l  ° 


■ f ^[♦(t;u  )/Q  ((>(t;u  )]dt 
0 ic  H t 


(374) 


6)  Compute  the  maximal  eigenvalue,  and  Its  associated* normalized 
eigenvector  for  the  positive  symmetric  matrix  Vp.  Letting  6jj(J)  denote 
the  component  of  6^^,  the  optimizing  control  input  la  then  given  by 

“b 

u"(t)  - Z Bjja)Uj(t)  (375) 

J-1 


and  the  optimal  value  of  the  cost  functional, 
given  by 


J 


K 


z.s 


(u),  equation  (364), 


Is 


(u*) 


”b 

L 6„(3)VpCj,J) 

J-1 


(376) 


Stop. 

Comment  1 Computational  Aspects 

The  preceding  algorithm  is  well  suited  to  digital  computer  applica- 
tions. In  fact,  for  a single  control  Inpr.t  (r  1),  th»  total  number  of 
Integrals  to  be  computed  are,  at  most: 

1)  2nn^  to  compute  the  n^^  vector  functions 


^(b}Uj),  J ■ 1,  2,  • • .n^,  t c [0,t£] 
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(•M  •quatlon  (371)).  Note  thet  depending  upon  the  heels  functions,  u^, 
these  Integrals  may  In  fact  he  computed  analytically. 

l)/2  to  compute  the  symmetric  matrix  (sec  equation 

(374)). 

The  remaining  computations  of  the  algorithm  are  straight-forward  matrix 
operations.  Also,  In  the  example  of  Section  VII. 2. c we  will  see  that  the 
special  properties  of  Walsh  basis  functions  allows  us  to  reduce  the  number 
of  required  Integrals  In  computing  the  functions  4(*:Uj),  j ~ 1^  2,  ...n^, 
to  2n.  If  n^  Is  large  then  this  Is  a significant  reduction  In 
the  amount  of  computation  and  storage. 


from 


2nn^ 


Comment  2 Change  of  Weighting  Constants, 

In  Section  II. 8. a we  discussed  the  fact  that  It  is  often  Important 

to  Iteratively  adjust  the  weighting  constants,  bo  that  the  optimal 

Input  maximizing  the  weighted  trace  cost  functional,  (u),  equation 

^z.s. 

(364),  more  closely  apjiroxlmates  minimization  of  either  the  determinant 
or  trace  of  the  Inverse  Information  matrix.  (See,  e.g.,  Mehra  (Ref  114) 
(Ref  115)  or  Gupta  and  Hall  (Ref  108).)  Therefore,  It  Is  Important  to 


point  out  that  once  the  Inn^^  Integrals 


f(t  - s)u.(s)ds  (377) 

0 J 


j - 1,  2,  ...n^j  t e [0,tjl 

of  equation  (371)  have  been  computed,  they  are  not  affeeped  by  adjustment 
of  the  constants.  Therefore,  after  the  first  Iteration  the  amount  of 
computations  are  significantly  reduced  for  subsequent  adjustments  of  the 
weighting  constants,  k... 
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Comment  3 Time- Varying  Matrices 

The  algorithm  Is  easily  modified  If  either  the  output  C matrix  or 
input  B matrix  are  time-varying;  the  computations  are  essentially  the 
same  except  that  the  Indicated  Integrals  must  Include  the  time-varying 
elements  of  these  matrices.  However,  if  the  system  plant  matrix.  A,  Is 
time-varying  then  the  convenient  matrix-operator  form  (equation  (365))  no 
longer  applies.  In  this  case  Theorem  1 could  be  utilized,  but  It  Is  prob- 
ably computationally  more  efficient  to  compute  the  parameter  sensitivities, 

v^^^  (t;u,),  required  In  equation  (374),  from  the  sensitivity-system 

z.s.  J 

differential  equations  (89)  - (91).  Indeed,  this  approach  could  also 
be  applied  for  the  time-invariant  system  and  controllability  properties 
of  the  sensitivity  system  (see  Section  IV. 3)  could  then  be  used  to  reduce 
the  required  number  of  differential  equations. 

b.  Solution  via  the  Maximum  Principle 

As  previously  started  the  theory  of  the  Input  design  problem  discussed 
In  Section  VII. 2 was  developed  by  Mehra  (Ref  112).  In  that  paper  he 
reformulates  the  operator/elgenfunctlon  problem  Into  a differential  equa- 
tion two-point  boundary  value  problem  through  the  use  of  the  "sensitivity 
system"  and  the  Maximum  Principle  (Ref  22) . For  the  control  constraint 
[u,u]  ^ 1,  he  shows  that  the  optimal  Input  satisfies  the  two-point 
boundary  value  problem 


d 

dt 

X(t;u) 

X(t) 

m 

— — T 

A -oBB 

— T — -T 

C QC  -A^ 

X(t;u) 

X(t) 

— - 

_ .. 

r-  -r 

s H 


X(t;u) 

A(t) 


(378) 
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X(0;u)  - 0 


X(t.)  - 0 

(»  r 


u*(t)  - -o*B’^X(t) 


(379) 


(380) 


vhere  X(0  is  an  n(p  + 1)  dimensioned  Lagrange  multiplier  function;  Q is 
given  by 


Q 0 
0 Q 


. 0 


• • • 


(381) 


_|  m(p+l)xm(p+l) 


and  X(t;u),  A,  B,  and  C are  defined  by  expressions  (92)  - (94)  of  Section 
III.  The  smallest  value  of  constant  a for  a nontrivial  solution  to  (378)  - 
(380)  gives  the  optimal  control  input  computed  from  equation  (380) . Mehra 
also  shows  that  the  optimum  a*  is  Inversely  proportional  to  the  optimum 
value  of  the  cost  functional  (trace  of  the  weighted  local  information 
matrix) . 

Mehra  (Ref  112)  suggests  solving  this  two-point  boundary  value  prob- 
lem by  means  ot  a Rlccatl  equation  method,  and  in  Reference  120  Stepner 
and  Mehra  present  a detailed  computational  algorithm  using  this  technique. 
However,  like  the  Rlccatl  equation  method  used  in  Section  VI. 3 for  minimum 
sensitivity  control  design,  this  approach  requires  the  precise  integration 
of  the  nonlinear  matrix  Rlccatl  equation  of  order  n(p  + 1)  x n(p  1). 

This,  coupled  with  other  computational  problems,  makes  the  Stepner  and 
Mehra  algorithm  somewhat  less  than  satisfactory,  (Ref  108).  In  more  recent 
work  Gupta  and  Hall  (Ref  108)  have  used  an  eigenvalue-eigenvector  decomposi- 


tion of  the  Hamiltonian  matrix,  H,  to  solve  the  two-point  boundary  value 
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problem,  and  thereby  they  avoid  integration  of  the  nonlinear  Riceatl 
differential  equations.  This  technique,  coupled  with  taking  the  controll- 
ability properties  into  account  to  eliminate  redundant  calculations  in  the 
sensitivity  system  (see  Section  IV. 3)  makes  the  total  computational  require- 
ments of  the  Gupta  and  Hall  algorithm  roughly  equivalent  to  those  of  the 
algorithm  of  Section  VII. 2. a.  However,  there  are  some  other  important 
differences  between  the  two  methods  of  solution. 

The  first  difference  is  that  Instead  of  specifying  the  time  Interval 
of  observation  [0,  t^]  (as  is  the  case  In  our  algorithm),  the  two-point 
boundary  value  method  specifies  the  constant  value  a'*^.  In  essence,  this 
specifies  the  optimal  value  of  the  cost  functional  (trace  of  the  weighted 
Information  matrix),  and  the  resulting  optimal  control  and  time  Interval 
of  observation  [0,  t|]  are  computed  from  this  constant  value  a*.  If  this 
Is  not  the  desired  time  Interval,  then  o*  must  be  adjusted  and  the  problem 
recomputed.  In  certain  cases  It  may  be  convenient  to  specify  the  level  of 
the  "Information"  required  from  an  experiment  and  the  time  Interval  of 
observation  fixed  according  to  this  level;  however.  In  other  cases  the 
time  Interval  of  observation  must  be  fixed  and  one  must  use  whatever  Informa- 
tion Is  obtainable  on  this  interval.  In  the  first  Instance  the  algorithm 
of  Gupta  and  Hall  might  be  the  more  direct  while  In  the  latter  case  the 
algorithm  of  Section  VI. 2. a might  be  more  straight-forward  to  utilize; 
however,  in  either  case  the  dependent  parameter  may  be  adjusted  by  suit- 
able Iteration  of  the  Independent  parameter. 

The  second  difference  Is  simply  one  of  Implementing  an  exact  optimal 
control  function  as  obtained  by  the  Gupta  and  Hall  method  versus  the 
Implementation  of  control  law  which  Is  formed  by  the  optimal  linear 
combination  of  preselected  basis  functions.  It  was  mentioned  previously 
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that  the  basis  functions  used  In  the  Rayleigh-Rltz-Galerkln  algorithm 
might  be  selected  with  some  physical  constraints  or  other  considerations 
In  mind,  and  this  might  facilitate  the  actual  Implementation  of  the  con- 
trol law.  Again  this  difference  Is  not  of  major  Importance  because  the 
"exact”  optimal  control  may  be  approximated  on  the  one  hand,  and  any  level 
of  "optimality"  may  be  achieved,  on  the  other  hand,  simply  by  Increasing 
the  number  of  basis  functions  over  which  the  optimization  Is  performed. 

The  final  major  difference  Is  In  the  amount  of  computations  required 
to  recompute  a new  optimal  Input  once  the  weighting  constants  are 
adjusted.  In  the  Gupta  and  Hall  algorithm  the  amount  of  computation 
remains  the  same  for  each  new  set  of  weightings,  whereas  the  amount  of 
computation  via  the  algorithm  of  Section  VII. 2. a Is  significantly  reduced 
once  the  first  optimal  value  has  been  obtained  (see  Comment  2 of  Section 
VII. 2. a).  Since  It  Is  often  quite  important  to  make  such  adjustments  to 
the  weighting  constants  (see,  e.g.,  Mehra  (Ref  114)  (Ref  115)  or  Gupta 
and  Hall  (Ref  108)),  this  Is  a significant  difference  between  the  two 


\ 


algorithms. 

c.  Example 

To  Illustrate  the  use  of  the  Raylelgh-Ritz-Galerkin  algorithm  bf 
Section  VII. 2. a,  we  once  again  consider  the  second  order  time- Invariant 
system: 


Xj^(t;u) 

X2(t;u) 


1 

b. 


Xj^(t;u) 

*2 (t;u) 


u(t) 


(382) 


yi(t;u) 

y2(t;u) 


1 0 
0 1 


Xj^(t;u) 

X2(t;u) 


t € [0,  8] 


(383) 
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with  sero  initial  conditions.  Also,  in  order  to  Illustrate  one  time  the 

eonputstional  features  associated  with  complex  eigenvalues)  ve  will  here 

T 

assume  that  the  nominal  parameter  vector  is  b • [-3  -2  1]  . There- 


fore, the  nominal  eigenvalues  are 


Then  the  generalized  Vandermonde  matrix.  A,  described  In  Appendix  A 


and  the  vector  function  f(t)  Is 
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Since  there  is  a single  control  input,  the  matrices  H 


are  computed  from  the  expression 


Then  assuming  that  Q is  the  identity  matrix  and  that 


This  matrix  was  computed  on  the  digital  computer  to  be 


With  these  preliminaries  we  are  ready  to  begin  the  Raylelgh-Rltz- 
Galerkin  optimization.  To  do  so  requires  that  we  select  a set  of  orthogonal 
basis  functions,  {Uj(*)  e L2(0,8;R);  J - 1,  2,  over  which  we  desire 

to  optimize.  For  example,  sinusoids  could  be  selected  and  such  functions 
have  found  frequent  application  in  parameter  identification  problems. 
However,  it  will  be  particularly  illuminating  to  select  an  orthogonal  set 
of  Valsh  functions  (Ref  126))  as  our  optimization  set.  We  will  see  that 
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th«  ■p«cl«l  properties  of  the  Welsh  functions  enebles  us  to  reduce  the 
overell  computetlonal  requirements  substentielly.  Indeed,  the  method  by 
uhleh  ve  reduce  the  computetlons  is  useful  in  its  own  right,  end  should 
contribute  to  already  existing  methods  for  utilizing  Walsh  functions  in 
system  simulation  and  control  function  design.  (See,  e.g.,  Chen  and 
Hsiao  (Ref  123)  (Ref  124)  and  Corrlngton  (Ref  125)  where  Walsh  functions 
are  used  to  approximate  the  solution  of  differential  equation  representa- 
tions of  dynamic  systems.) 

Therefore,  we  choose  as  our  basis  set  the  first  sixteen  Walsh  func- 
tions. These  may  be  defined  by  the  following  16  x 16  dimensioned 
symmetric  matrix: 

1111111111111111 
1 1 1 1 1 1 1 1 -1  -1  -1  -1  -1  -1  -1  -1 

1 1 1 1 -1  -1  -1  -1  -1  -1  -1  -1  I 1 1 1 

1 1 1 1 -1  -1  -1  -1  1 1 1 1 -1  -1  -1  -1 

1 1 -1  -1  -1  -1  1 1 1 1 -1  -1  -1  -1  1 1 

1 1 -1  -1  -1  -1  1 1-1-1  1 1 1 1-1-1 

1 1-1-1  1 1 -1  -1  -1  -1  1 1-1-1  1 1 

1 1-1-1  1 1-1-1  1 1-1-1  1 1-1-1 

*a  " 1-1-1  1 1-1-1  1 1-1-1  1 1-1-1  1 

1-1-1  1 1-1-1  1-1  1 1-1-1  1 1-1 

1-1-1  1-1  1 1-1-1  1 1-1  1-1-1  1 

1-1-1  1-1  1 1-1  1-1-1  1-1  1 1-1 

1-1  1-1-1  1-1  1 1-1  1-1-1  1-1  1 

1-1  1-1-1  1-1  1-1  1-1  1 1 -l”  1 -1 

1-1  1-1  1-1  1-1-1  1-1  1-1  1-1  1 

1-1  1-1  1-1  1-1  1-1  1-1  r -1  1-1 

(392) 

Each  of  the  column  vectors  of  this  16  x 16  matrix  corresponds  to  one 
Walsh  function,  and  each  row  corresponds  to  the  function  values  on  the 
corresponding  one-half  second  Interval  of  time. 
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Nov  let  Wj  - [Wj(l)  Wj^2)  ...  Wj(16)l^  be  the  column  vector  of 
end  define  the  16  x 1 dimensioned  vectors 


1/2  ^ ® T 

- [/  ■'■  •••  •'“  • 

* 0 * 1/2  * 15/2  *■ 

k ■>  1,  2,  3,  4 (393) 

where  f^(*).  k - 1,  2,  3,  4,  are  the  four  elements  of  f(').  equation  (387). 
Notice  that  each  of  the  Integrals  In  0^  may  be  computed  analytically.  In 
order  to  compute  the  optimal  solution  via  the  Raylelgh-Rltz-Galerkln 
method  of  solution,  we  must  compute  the  vector  functions  of  t c [0,  8] 


♦(t;u.)  • / f(t  - s)u, (s)ds 


J 


3 


(394) 


where  Uj(*)  e 12(0.  8;  R),  J - 1,  2,  ...16,  are  the  sixteen  Walsh  basis 
functions.  Using  the  relation 


f(t-s)  - e 


-t 


— 

cob/T  t 8ln/2  t 0 0 

e"  C08»^  8 

slni^  t “COs/?  t 0 0 

e*  slnx^  8 

t cos/?  t t 8ln>^  t cos/?  t sin/?  t 

-8  e*  cosi^  s 

t sln/^"  t -t  cos/T  t sln/T  t -cosn^  t 

-8  e*  8ln/2  8 

5 f(t)f(-8) 


(395) 


we  may  then  compute  the  vector  functions  ^(su^)  e L2(0,^8;  R ), 
j ■ 1,  2,  ...16,  from  the  relations 
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♦(tju.)  - /*  f(t  - ■)u  (s)d8 

J 0 ^ 

■ T(t)  f(-8)u. (8)d8 
0 ^ 

whcr*  the  element  of  thle  Integral  la  computed  from 


(396) 


f.  (-8)u  (8)d8 
0 * J 


L 

£ 

1-0 


,(i  + i)o^(i  + 1)  + w,a  + 1)  / 


j 


L/2 


fj^(-8)d8 


(397) 


Therefore,  to  compute  all  n^ 
It  now  only  requlrea  the  2n  < 


' 16  vector  functlona  6(t;Uj),  t c [0,  8] 
4 Integrala  over  the  Interval  (0,  8]: 


9 


f.(-B)de  t e [L/2,  L/2  + 1/2] 

L/2  ^ 

L * Of  Xy  e e e 15 

k - 1,  2,  3,  4 (398) 

That  la,  becauae  ve  are  dealing  with  Walah  functlona  (functlona  which  are 
either  -fl  or  -1  over  finite  Intervale),  and  becauae  we  are  able  to  write 
f(t  - a)  In  the  factored  form  Y(t)f(-8),  the  Integrala  In  computing  the 
functlona  4(t:uj),  J - 1,  2,  ...16,  are  Independent  of  apeclflc  u^. 
furthermore,  the  Integrala  (398)  may  all  be  computed  analytically.  Thua 
while  there  would  normally  be  2nn^  - 64  Integrala  required  to  compute  the 
16  Individual  vector  functlona  6(t;uj),  there  are  now  only  ^ - 4 Integrals* 
required  and  theae  may  all  be  computed  analytically.  Thla  la  quite  a aub- 
atantlal  reduction  In  computation  and  atorage  which  la  asaoclated  with  the 
uae  of  Valah  functlona  In  the  Raylelgh-RltZ'^lerkln  optimal  Input  design 
algorithm.  We  also  conent  that  thla  same  operator  technique  could  be 
applied  to  other  simulation  and  control  design  problems  which  are  associated 
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with  the  use  of  Walsh  functions  with  linear  time- Invariant  systema. 

(See,  e.g.,  Chen  and  Hsiao  (Ref  123)  (Ref  124)  and  Corrlngton  (Ref  123)). 

To  complete  the  Raylelgh-Rltz-Galerkln.  algorithm  requires  that  we 
compute  the  symmetric  16  x 16  dimensioned  matrix,  V^,  equation  (374),  with 
1^^  row  and  column  element 

8 

VQ(l,j)  • / [4'(t;u^)/Qg(|>(t;u^)]dt  (399) 

This  requires  + l)/2  « 136  Integrals  which  cannot  be  reduced.  Once 

computing  this  matrix,  we  then  determine  Its  maximal  eigenvalue  and 
corresponding  normalized  eigenvector,  c R^^.  The  optimal  Input  Is 
then  computed  to  be 

16 

u*(t)  - I 6jj(j)Uj(t),  t e [0.  8]  (400) 

J-1 

and  will  have  norm  equal  to  8.  This  problem  was  mechanized  on  the 
CDC  6600  digital  computer,  and  the  resulting  computed  optimal  Input  Is 
shown  In  Figure  1. 

3.  SUMMARY 

In  this  section  we  have  applied  the  algebraic  representation  of  the 
sensitivity  operators  for  the  system  (Theorem  4 and  Appendix  A)  to 
the  determination  of  the  optimal  Initial  condition  vector  and  control 
Input  function  for  Improved  parameter  Identification.  The  methods  used 
are  based  upon  the  discussion  of  Section  II. 3 for  the  Hilbert  apace  sys- 
tem S„.  Like  the  previous  Section  VI,  the  results  are  an  excellent 
n 

Illustration  of  the  computational  utility  of  the  operator  representation 
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I and  approach.  For  optimal  control  Input  design  the  Raylelgh-Rltz-Galerkln 

k 

method  Is  the  selected  computational  procedure,  and  this  Is  shown  to  be  a 

▼lable  alternative  to  previously  employed  Rlccatl  equation  techniques. 

r 

Finally,  if  Walsh  functions  are  selected  as  the  set  of  basis  functions  for 
the  Raylelgh-Rltz-Galerkln  uptlmlzatlon,  then  considerable  computational 
reductions  are  derived.  These  reduction  methods  should  also  have  general 
application  to  the  use  of  Walsh  functions  for  system  simulation  and  control 
design  In  linear  time-invariant  systems. 
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Section  VIII 


SUMMARY  AND  RECOMMENDATIONS  FOR  FUTURE  WORK 


In  this  section  we  review  the  major  objectives  and  contributions  of 
this  research,  and  then  discuss  some  potential  areas  of  extension  and 
future  research. 

1.  OBJECTIVES  AND  CONTRIBUTIONS 

There  were  two  major  objectives  in  this  research  and  presentation. 
The  first  was  to  demonstrate  the  useful  way  in  which  an  operator  approach 
could  help  understand  and  Investigate  system  properties  of  parameter 
sensitivity  In  linear  dynamic  systems.  The  sensitivities  themselves  are 
linear  operators  (Frechet  derivatives)  and  so  this  is  a natural  way  in 
which  to  treat  them.  In  Section  II  the  tools  of  functional  analysis 
were  applied  to  Investigate  many  of  theli  system  properties  for  a general 
Hilbert  space  linear  system.  Of  particular  prominence  in  this  analysis 
was  the  adjoint  operator  of  the  sensitivity  operators;  the  adjoint 
operator  played  a major  role  In  every  aspect  of  sensitivity  theory  which 
was  discussed  In  Section  II.  Section  II,  then,  laid  the  ground  work  for 
this  operator  approach  to  treating  the  parameter  sensitivities  and  estab- 
lished Important  methodology  for  application  to  linear  ordinary  differen- 
tial equation  systems  In  Sections  III  - VII. 

The  second  objective  was  to  use  this  operator  approach  In  the 
development  of  practical  computational  algorithms  for  application  to 
simulation,  analysis.  Identification,  and  control  of  linear  ordinary 
differential  equation  systems.  It  Is  felt  that  the  new  matrix-operator 
description  of  the  parameter  sensitivities  In  linear  time- Invariant  sys- 
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tens  (Theorems  2-4)  developed  In  Section  III  contributed  to  realizing 
this  secono  objective.  This  form  provided  extensions  to  existing  struc- 
tural Information  concerning  the  sensitivity  system  (Guardabassl,  et  al 
(Ref  51)  (Ref  52))  and  to  obtain  a new  and  completely  general  method  for 
computing  the  eigenvalue  sensitivities  (Section  III. 2). 

Further  geometrical  properties  of  this  matrix-operator  form  of  the 
parameter  sensitivities  were  then  studied  In  Section  IV.  Some  previously 
known  results  concerning  Insensitivity  (Theorems  5 and  6)  and  sensitivity 
controllability  (Theorem  7)  were  derived  In  a more  direct  fashion  by 
using  this  form.  Discussing  the  relationship  between  Insensitivity  and 
Identlf lability,  a systematic  way  of  designing  Identification  experiments 
which  would  reduce  the  number  of  "Influencing"  parameter  components  was 
discussed  (Corollary  5.1).  A new  result  concerning  necessary  and 
sufficient  conditions  for  zero  terminal  sensitivity  was  derived  (Corollary 
7.1),  and  the  relationship  between  insensitivity  and  sensitivity  controll- 
ability was  discussed.  Next,  new  local  and  structural  conditions  for 
zero-input  and  zero-state  Identlflablllty  were  obtalnea  (Theorems  8 and 
9).  These  conditions  were  completely  general,  were  only  dependent  upon 
the  structure  of  the  system  matrices,  and  required  no  Integrals  to  compute. 
Finally,  the  relationship  between  the  control  Input  and  steady-state 
Identlflablllty  was  discussed  and  given  new  Insight  from  the  matrix- 
operator  form  of  the  parameter  sensitivities. 

The  computational  applications  of  this  matrix-operator  form  were 
next  considered.  In  Section  V this  form  was  used  In  a fuaslllnearlzatlon 
algorithm  for  parameter  Identification  In  linear  time-invariant  systems. 
Each  Iteration  of  the  algorithm  required  that  the  new  eigenvalues  of  the 
A matrix  be  computed  and,  for  state  dimension  n and  control  dimension  r. 
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the|^  were  but  2nr  quadrature  Integrals  required  to  compute  the  complete 
sec  of  parameter  sensitivities.  Based  upon  equation  (157)  of  Section  III 
It  was  shown  how  the  eigenvalue  sensitivities  could  be  obtained  directly 
from  the  matrix  description  of  the  output  parameter  sensitivities  and  thes 
eigenvalue  sensitivities  could  then  be  used  to  help  Iteratively  compute 
the  new  eigenvalues.  Also,  based  upon  methods  presented  In  Section  IV. 3, 
linear  dependency  of  row  vectors  in  the  matrices  of  the  operator  descrip- 
tion could  be  used  to  reduce  the  number  of  Integrals  required  to  compute 
the  local  Information  matrix  and  local  gradient  function  for  each  Iteration 
of  the  quaslllnearlzacion  algorithm.  These  reduction  techniques  were 
similar  In  approach  to  methods  applied  by  Gupta  and  Mehra  (Ref  84)  for 
reducing  computations  In  differential  equation  descriptions  of  the 
parameter  sensitivity  system. 

Next  the  matrix-operator  form  of  the  parameter  sensitivities  In 
linear  time-invariant  ordinary  differential  equation  systems  was  used 
as  the  basis  of  a gradient  method  for  computing  the  optimal  open-loop 
minimum  sensitivity  control  law  (Section  VI).  This  gradient  method  of 
solution  was  developed  In  Section  II. 5.  By  using  the  matrix-operator 
description  of  the  parameter  sensitivities  obtained  In  Section  III,  the 
computational  requirements  of  this  method  proved  to  be  quite  efficient 
and  the  computations  Increased  only  linearly  with  state  dimension  p. 

This  was  In  contrast  to  previously  employed  Rlccatl  equation  methods  of 
solution  (e.g.,  Kahne  (Ref  7C))  In  which,  even  with  the  use  of  minimal 
order  sensitivity  models,  the  amount  of  computation  Increases  with  the 
square  of  n and  square  of  r. 

The  final  application  of  this  matrix  operator  form  was  In  the 
computation  of  the  optimal  Initial  condition  vector  and  optimal  control 
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input  to  improve  parameteT  Identification  from  the  zero-input  and  zero- 
atate  responses,  respectively.  The  theory  of  the  methods  was  based 
upon  results  of  Mehra  (Ref  112)  for  optimal'  input  design  from  the  zero- 
state  response,  and  so  the  main  contribution  here  was  to  show  the  computa- 
tional utility  of  the  matrix-operator  form  of  the  parameter  sensitivities. 
For  the  optimal  input  design  problem,  a Rayleigh-Ritz-Galerkin  optimiza- 
tion method  was  selected.  Using  the  matrix-operator  form  of  the  parameter 
sensitivities,  this  optimization  yielded  a highly  efficient  and  computa- 
tionally straight-forward  method  of  solution.  Additionally,  if  Walsh 
functions  were  selected  as  the  set  of  basis  functions  for  the  Raylelgh- 
Rltz-Galerkln  procedure,  then  the  computational  requirements  could  be 
reduced  still  further.  An  example  Illustrating  the  method  was  presented, 
and  the  technique  for  reducing  the  computations  with  the  use  of  Walsh 
functions  should  have  further  applications  to  system  simulation  and 
control  design  in  linear  time-invariant  ordinary  differential  equation 
systems. 

In  all  of  these  applications  to  linear  time-invariant  ordinary  dif- 
ferential equation  systems  in  Sections  III  -VII,  the  operator  methodology 
was  used  as  an  underlying  foundation.  It  is  hoped  that  the  methods  and 
results  presented  here  will  be  computationally  useful  and  will  spur  new 
researih  into  this  type  of  an  approach  for  treating  parameter  sensitiv- 
ity in  linear  dynamic  systems.  Just  a very  few  of  the  unanswered  ques- 
tions and  potential  avenues  for  further  research  are  touched  upon  in 
the  following  suosectlon. 

2.  EXTENSIONS  AND  FUTURE  RESEARCH 

Although  the  sensitivity-related  system  properties  developed  in 

Section  II  were  done  for  a very  general  Hilbert  space  linear  system, 

they  were  applied  only  to  linear  ordinary  differential  equation  systems 
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In  Sections  III  - VII.  Therefore,  It  would  be  quite  desirable  to  apply 
these  aethcds  to  linear  hereditary  and  partial  differential  equation 
systems  as  well.  (See.  for  example,  (Ref  !■).)  Of  particular  signifi- 
cance In  this  regard  might  be  the  design  of  a control  Input  to  optimize 
parameter  Identification  capability  In  such  systems;  the  Raylelgh-Rltz- 
Galerkln  algorithm  could  readily  be  applied  as  a solution  method  for 
such  systems.  The  gradient  approach  to  minimum  sensitivity  control 
design  might  also  be  easily  applied,  particularly  for  the  minimum  "termi- 
nal" sensitivity  problem.  Indeea,  In  Section  VI. 1 the  point  was  made 
that  the  minimum  terminal  sensitivity  control  problem  inherently  Involves 
no  convolution  Integrals.  This  is  particularly  Important  for  such 
general  systems  because,  like  the  case  for  a time-varying  plant  matrix, 
such  convolutions  probably  cannot  be  transformed  Into  the  far  more 
desirable  quadrature  Integral  form  as  was  done  with  the  time-invariant 
system  S^^. 

The  next  major  area  of  extension  would  be  to  consider  minimum  sensitiv- 
ity control  design  when  the  control  law  Is  closed-loop  or  possesses  feed- 
back. This  subject  has  been  treated  extensively  through  "sensitivity 
system"  differential  eqtiatlon  approaches  (see,  for  example,  (Ref  69) 

(Ref  72)  or  (Ref  75)),  and  the  operator  philosophy  and  the  algebraic 
representation  of  the  sensitivity  operators  for  linear  time-invariant 
systems  may  yield  new  approaches  and  attractive  computational  techniques. 

Ve  have  carefully  restricted  our  attention  to  "open-loop"  control  design, 
as  the  closed-loop  questions  are  rich  In  both  theoretical  and  practical 
considerations  and  to  delve  Into  them  In  any  depth  would  have  expanded 
our  scope  considerably. 

A topic  area  similar  to  closed-loop  minimum  sensitivity  control 
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design  In  that  it  is  abundant  with  theoretical  considerations  is  the 
theory  of  dual  control  first  Introduced  by  Fel'dbaum  fRef  12).  Under- 
lying the  concept  of  dual  control  is  the  fact  that  the  control  law  which 
optimizes  some  particular  control  objective  may  not  be  optimal  with 
respect  to  estimation  of  the  system  characteristics,  parameters,  or 
states.  Indeed,  this  fact  is  emphasized  by  the  conflicting  goals  of 
having  the  sensitivity  operators  "large"  for  Improved  identification 
but  "small"  for  minimum  trajectory  perturbations.  Since  the  knowledge 
of  the  system  parameters  and  state  Inherently  affects  the  ability  to 
achieve  the  control  objective,  a truly  optimal  control  must  be  one  which 
takes  both  the  control  and  estimation  objectives  into  account.  The 
optimal  formulation  of  dual  control  results  in  an  infinite  dimensional 
problem  and  even  approximate  methods  of  solution  present  severe  computa- 
tion burdens.  (See,  e.g.,  (Ref  30).)  However,  parameter  sensitivity  is 
Inherently  related  to  both  the  identification  of  parameters  and  the 
design  of  control  laws  which  are  "forgiving"  to  uncertainty  in  the  exact 
knowledge  of  the  parameter  values.  Therefore,  techniques  designed  around 
the  use  of  the  matrix-operator  representation  of  the  parameter  sensitiv- 
ities in  linear  time- invariant  systems  might  yield  adaptive  control  laws 
having  approximately  optimal  dual  control  characteristics  and  which  are 
also  easy  to  compute. 

Another  area  which  appears  to  be  quite  fruitful  for  further  research 
is  a frequency  domain  description  of  the  parameter  sensitivities  and  an 
analysis  of  their  system  properties  in  the  frequency  domain.  The  oper- 
ator time-domain  approach  for  treating  the  sensitivities  is  closely  akin 
to  a frequency  domain  method,  and  the  algebraic  representation  which  we 
have  obtained  for  the  parameter  sensitivities  of  the  time-invariant  sys- 
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tea  (Secclon  HI)  can  also  ba  derived  in  Che  frequency  doaain  through 


partial  fraction  expansion  methods.  Frequency  domain  Mthbds  have  been 
very  effective  for  the  analysis  and  design  of  feedback  control  lavs  for 
minimum  sensitivity  (e.g.,  Horowitz  14)  •&<!  Horowitz  and  Shaked 

(Sef  15)).  Additionally,  Mehra  (Ref  114)  (Ref  116)  has  used  frequency 
domain  methods  quite  beneficially  In  the  design  of  control  laws  to  opti- 
mize parameter  Identification  capability.  Further  work  relating  the  time- 
domain  and  frequency-domain  approaches  would  be  most  beneficial. 

The  final,  and  perhaps  most  Important  area  of  extension  would  be 
to  use  the  methods  and  computational  techniques  of  Sections  III  - VII  to 
solve  practical  problems  in  simulation.  Identification,  and  control  - 
particularly  In  large  scale  systems  where  the  state  dimension  n and  param- 
eter dimension  p are  large.  Really,  It  Is  here  In  such  large  scale  sys- 
tem where  the  algorithms  and  results  obtained  should  have  their  most 
beneficial  payoff.  In  Section  IV  the  properties  of  Insensitivity  and 
Identlf lability  were  related,  and  techniques  were  suggested  for  the 
design  of  parameter  identification  experiments  which  might  reduce  the 


3 


number  of  "influencing"  parameters  In  each  experiment.  Such  methods 
would  be  particularly  significant  In  large  dimension  systems  and  practical 
examples  demonstrating  the  concepts  are  most  Important  In  establishing 
the  theory.  Also,  in  each  of  the  operator  algorithms  for  Identification 
and  control  suggested  In  Sections  V - VII,  a large  number  of*  differential 
equations  by  conventional  techniques  were  replaced  by  a comparatively 
small  number  of  quadrature  Integrals.  Since  obtaining  such  practical 
algorithms  Is  a major  aim  of  this  research.  It  Is  hoped  that  these 
results  will  indeed  prove  to  be  useful  tools  for  practical  applications 
In  systems  theory. 
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Appendix  A 

COMPDXATIONAL  DETAILS  FOR  OBTAINING  PARAMETER 
SENSITIVITIES  FOR  THE  SYSTEM  S^^ 

Let  Che  reel  n x n aetrlx  A heve  the  cherecterletlc  polynoelel 

A(q)  - n (q  - qj^)*^  (AOl) 

1p-l 

where  the  elgenvelues  q^^,  q2t  •••q^  ere  reel  end  the 

end  q^2  " k • x + 1,  t + 3,  ...p-1.  Define  the  generellzcd 

Vendemonde  metrlx.  A,  with  row  entries 


d 2 2n-l, 

“T  II  <1  q ...  q 


dq-' 


q-qi 


(402) 


for  k ■ 1,  2,  •••T,  J ■ Ot  1,  ...2n|^>l  end  double  row  entries 


Reel  ^ (1  q q^ 


2n-l, 


dq-' 


q q*-  ...  q‘“  *]  ) 

Vqu 


(403) 


t~»  ti  q q*  ...  q^'\.,  ) 
dqJ  ^^k 


for  k - T + 1,  T + 3,  ...p-l»  j ■ 0,  1, 
dimensioned  vector  functions 


2n|^-l.  Next  define  Che  2n 


(404) 
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20,-1  q,t 

t • 


f(t)  S 


(405) 


1+1  - » 
e cosu^^j^t 

o ..t 

e ^ •inu^j^t 

. "t+i' 

te  co8u^_j.j^t 

°T+1^  . 

te  8lnu)^2^ 


Then  we  may  determine  the  eealar  functlonst  a^C.),  of  Theorem  4 from  the 


equation 


a(t)  - a"^  f(t) 


(406) 


However,  In  computational  appllcatlona  we  will  generally  not  use 
Theorem  4 and  the  scalar  functions  aj(*).  Rather  we  will  use  a "component" 
representation  of  the  parameter  sensitivities,  as  such  » form  allows  us  to 
directly  convert  convolution  Integrals  Into  quadrature  Integrals.  To 
this  end,  define  the  2n(p  + 1)  m-dlmenslonal  vectors 


P^°^(k)  5 I (CA^"^d)A“^(J,k) 


(407) 
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T^^^(k)  H E (CA^“^  )(^jA'^(J,k)  ( 

J-1 

and  the  2o(p  -t-  1)  m x r-dtmensloned  matrices 

J-1 

2ii 

3 E (CA^"^B)^^jA"^CJ.k) 

J-1 

1 ■ It  2f  •••Pt  k * If  2t  • • • 2o 

Note  that  the  partial  derivatives  may  be  computed  recursively  on  the 
digital  computer  by  realizing  that 

(aJ«)(1)  - A(aJ),„  + A,„  aJ 


Next  ve  define  the  matrices 


p(l)  5 [p^^)(l) 

(2) 

...  F^*^2n)]^ 

h(1)  = [H^^^(l) 

H^^\2) 

...  H^^^(2n)] 

mx2n 


1 - C 

» 1»  • 

..p 

%(or 

;(o)' 

f(1) 

h(i) 

F 3 

e 

a 

H 

e 

e 

,’(P) 

h(p) 

r 

m(p+l)x2n 
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th«  augMnted  vector 


t(t;u)  - [y^(t;u)  v^^^  (t;u) 


end  the  epeclel  product 


f^COuCt) 


f(t)*u(t)  = 


f2(t)u(t) 


f2n(t)«(t) 


t ^ 

...  (t;u)l 


2nn(l 


(415) 


(416) 


Then  In  terms  of  these  quantities  ve  may  rcvrlte  Theorem  4 In  the  compsct 


astrlx-operator  form: 


Theorem  A.l 


For  the  system  S. 


Y(t;u)  - Ff(t)  + H r f(t  - s)*u(s)ds 
0 


(417) 


This  form  Is  entirely  equivalent  to  Theorem  4 but  Is  most  convenient 

for  digital  computer  computation  because  the  elements  of  the  vector  func- 

1 

tlon  f(t)  are  the  Individual  modal  functions,  f's  (or  cosines  and  sines 
for  those  eigenvalues  which  are  complex  conjugate  conjugate  “pairs).  This 
fact  allows  us  to  directly  transform  the  convolutions  of  equation  (417) 
Into  quadratures  through  use  nf  relationships  such  as 


qi.(t-s)  q.t  -q.s  q.t  -q.s 

(t  - s)e  - te  e - e se 


(418) 


i ! 


This  is  sn  sxtrsBMly  Importsnt  fsct  snd  ws  csnnot  ovsrststs  chs  coaputs- 
tlonsl  significance  of  this  slaple  relationship. 

Troa  Theorea  A.l  ve  see  that  the  entire  sensltlrlty  systea  aay  be 
generated  with  at  aost  2nr  quadrature  Integrals.  This  nuaber  of  Integrals 
aay  be  reduced  to  a alnioua  by  ellainatlon  of  coluan  vectors  of  H which 
are  identically  sero.  In  fact,  we  are  assured  that  the  column  '^natrls" 


H(k)  = 


(k) 


(419) 


(k) 


a(p+l)xr 


will  be  identically  zero  if  k corresponds  to  s multiplicity  of  one  of  the 
eigenvalues  which  Is  greater  than  that  eigenvalue  s multiplicity  In  the 
alnlnal  polynomial  of  ' 


0 ...  0 


A(i)  A 


(420) 


^(P)  ° 


Jn(p+l)xn(p+l) 


In  fact,  letting  v be  the  order  of  the  minimal  polynomial  of  A,  we  are 
assured  at  least  2n  - p of  these  columns,  H(k),  will  be  Identically  zero 
(or  approximately  so  when  considering  numerical  errors  on  the  digital 
coaputer).  Therefore,  the  sensitivity  Y(t;u)  may  be  coaq>uted  with  only 
pr  quadratures  Instead  of  2nr  quadratures. 
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Appendix  B 

SECOND  ORDER  SENSITIVITY  OPERATORS  FOR  THE  LINEAR 
IDffi-INVARIANT  SYSTEM  Sj^ 


In  • perallelt  but  eomeiduit  abbreviated,  aianner  to  Section  III  ve  %rlll 
develop  operator  ejcpreaalona  for  the  aecond-order  aenaltlvltlea 


= 3 Y.Cti.b) 


b-b 


'(l.J) 


(t) 


(421) 


Fslng  the  aecond  order  Taylor's  series  approximation.  It  Is  seen  that  such 
second  order  sensitivity  operators  provide  a more  acccrate  prediction  of 
the  output  perturbation  caused  by  small  parameter  variations: 


y(t;b) 


y(t;b^)  + 


P 

E v'“*^^(t)Ab^  + 1/2 
1-1 


P **  (1  1) 

I I v^*^’J^(t)Ab^Abj 

1-1  J-1 


+ e^Ct)  (422) 

where  la  of  third  order  In  Ab.  Also,  In  Section  II. 6. c the  second 
order  sensitivity  operators  were  required  for  the  Newton-RaiShson  param- 
eter Identification  algorithm. 

If  it  Is  assumed  that  the  second  partial  derivatives  of  A,  B,  and 

C exist  and  are  continuous  at  b , then  the  second  order  sensitivities 

0 

may  be  obtained  from  the  "sensitivity  systas"  differential  equations 
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+ B(,^jjU(t) 


(423) 


5(l,j)(0)  . d 


(l.J) 


(424) 


(425) 


The  solution  of  these  equations  together  with  the  solution  of  the  n(p  + ^) 
differential  equations  to  obtain  x and  5^^^(t),  1 * 1,  2,  ...Pt  require* 
a total  of  at  most  n(p  + 1 + p f)  coupled  linear  differential  equations. 
"Minimal-order"  models  of  the  secona  oraer  sensitivities  have  been  ob- 
tained (Blngulac  (Kef  34)  Varshney  (Ref  40)),  but  by  and  large  there  has 
been  relatively  little  Investigation  of  the  second  order  sensitivities. 
Here  we  show  that  results  parallel  to  Section  III  can  be  used  to  compute 
the  first  and  second  order  sensitivities  with  at  most  3nr  quadrature 
integrals . 

As  in  Section  III  the  first  step  In  obtaining  an  operator  expression 
for  the  second  order  sensitivities  Is  the  determination  of  the  second 
partial  derivatives  of  the  state  transition  matrix,  j)^*^  ” ®^i  J) 

The  following  theorem  Is  the  counterpart  to  Theorem  1; 


Theorem  B.l 


Assume  A..  exists  and  Is  continuous  at  b . Then  for  each 

(l.j)  o 

t e [0,t^],  the  second  partial  derivative  i given  by  the 
Integral  equation 


J 


I 
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a26) 


or  •■  the  unique  solution  to  the  matrix  differential  equation 


(427) 

(428) 


Proof 


Let  *^^^(t)  be  the  unique  solution  of 


♦ ^j^j(t)  ■ A#^j^j(t)  + A^j^j#(t) 


.,„»)  . 0 


(429) 


and  (t)  denote  the  unique  solution  of 


*(1).  ^ 

Aj  J Aj  A,  J 


(0)  - 0 (430) 


(1) 


“J 


i 


j 
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which  has  unique  solution 


♦ <t)  - /*  - •){A,..*(8)  + (A.  - A)#,..  (■) 

0 *J 


+ (s)  - *(•))  + (A^^j  - A^^j)*^  (s)}ds  (432) 

J Aj  J 


Using  the  above  expression  and  the  definition  of  the  second  partial 


derivative 


*fl).  “ *(1)^‘^ 


♦ (i^j)(t)  H li« 


(433) 


lAbjl  - 0 


expression  (426)  of  Theorem  B.l.  readily  follows  (see  proof  of  Theorem  1). 
Equation  (427)  then  follows  as  a straight-forward  application  of  Leibnitz 
rule. 

Q.E.D. 

Like  the  essential  role  which  the  augmented  matrix  A^  plays  in  the 
definition  of  the  first  order  partial  derivatives,  ^he  augmented 

matrix 


(434) 


^(l.j)  ^(j)  ^(1)  ^ 


plays  an  essential  role  in  the  definition  of  the  second  partial  derivatives, 
e^‘  s 
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Theorem  B.2 


Using  the  relation 


and  Theorem  2 and  Theorem  B.l,  It  is  straight-forward  to  obtain  equation 


Like  Theorem  2,  Theorem  B.2  can  be  used  to  give  fundamental  algebraic 


relations  for  the  second  partial  derivatives,  e^^^  jj.  However,  before 
stating  such  representations,  the  following  general  result  concerning 


the  minimal  polynomial  of  A.,  will  be  useful.  The  notation  used  in  Sec 


tlon  III  concerning  the  minimal  polynomial  of  A will  be  maintained  here 


Suppose  that  the  y-order  minimal  polynomial  of  A is 


Then  the  minimal  polynomial  of  A.,  is 
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♦ij(q)  - n (q  - qj^) 


(438) 


and  has  order  where 


V i Wij  1 3u 


(439) 


enA  the  multiplicities  have  similar  bounds 


’"k 


(MO) 


Proof 


To  simplify  the  notation  let  D = adj  Iql  - A] : let  A be  the 
characteristic  polynomial  of  A;  and  let  the  unknown  matrix  components 
of  adj  [ql  - A^j]  be  designated 


adj[ql  - A^j]  - 


•l  *2  *3  “a 

h ^2  ^ ^ 

®1  *^2  *^3  ®A 

di  d^  d3  d^ 


(441) 


Then  It  Is  true  that  (Ref  21) 


tql  - A^j]  adj  Iql  - A^j]  - A**! 


which  can  be  used  to  form  a set  of  16  linear  equations  In  terms  of  the 
16  unknown  matrix  elements  of  adjiql  - fhen  usln&  the  relation 


[ql  - A]D  - AI 


(443) 


SIX 
:1  \ 
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It  Is  strslght-forvard,  but  algebrslcslly  soaevhst  tsdlousi  to  shov  that: 


*2  " *3  ■ *4  ■ ^3  ■ ^4  ■ ®4  ■ ® 

aj^  - b^  ■ Cj  - d^  - (445) 

bj^  - dj  - (446) 

Cj^  - dj  - A^AyjD  (447) 

dj^  - A^A^^  jjD  + ADA^jDA^^jD  + ADA^^jDA^jD.  (448) 


Mow  suppose  that  g(q)  Is  the  polynomial  such  that 


g(q)  - A(q)/\|)(q). 


(449) 


Then  g(q)  Is  the  largest  common  factor  of  every  term  of  D (Ref  21),  and 


so  the  polynomial  A(q)g'^(q)  Is  a common  factor  of  every  term  of 


adj[ql  - The  minimal  polynomial  of  A^j  is  given  by 


♦lj(q)  - d (q)/gy(q) 


(450) 


where  8^j(q)  Is  the  largest  common  factor  of  every  term  of  adj[ql  - A^^}. 


From  the  discussion  above,  A(q)g'^(q)  is  a factor  of  g^j(q),  and  so  the 


results  of  the  Lemma  follow. 

Q. E.D. 

Notice  that  like  Lemma  6,  explicit  expressions  for  the  minimal  poly- 


nomial of  A^j  could  be  provided  by  examining  common  factors  of  equations 


(443)  - (447).  Also,  if  structural  constraints  on  the  minimal  polynomial 


of  A are  assumed  in  a neighborhood  of  b^  c R^,  then  stronger  conclusions 


concerning  the  relationship  between  the  minimal  polynomial  of  A.  , and 

*•»! 


the  eigenvalue  sensitivities  can  be  made.  However,  these  stronger  results 
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ar«  notationally  cumbersome  end  have  little  apparent  practical  value 


and  so  they  are  not  Included  here 


B.l  la  now  used  in  combination  with  Theorem  B.2  to  (Ive  the 


basic  algebraic  description  of  second  partial  derivatives  of  e : 


Corollary  B.2.1 


p where  the  3n  scalar  functions  are  uniquely 


Proof 


It  is  straight-forward  to  show  that 
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Than  onlng  Thaoraa  B.Z  ic  is  easy  to  see  %ffaere  e polynoalel  representecloo 


of  order  An  could  be  obtained.  However*  ualnt 


This  le  the  central  reault  froa  which  convenient  algebraic  deacrlp- 


tlon  of  the  second  order  sensitivity  operators  can  be  obtained 


s)u(s)ds 


s)u(s)d8 


s)u(s)ds 


and  where  the  3n  scalar  functions  are  deter' 


■Ined  from  equation  (454) 


Proof 


Tho  proof  la  a atralghc-fomard  application  of  Corollary  1.2.1  to 
Cha  daflnltlon  of  tha  flrat  and  aacond  ordar  aanaltlvlty  oparatora. 

Q.K.D. 

Tha  eoovutatlonal  conaldaratlona  of  Thaoraa  B.3  ara  vary  alnllar  to 
thoaa  of  Thaoraa  4.  Motlea  that  tha  tachnlqua  of  Appendix  A can  ba  uaad 
to  convert  tha  Indicated  convolution  Intagrala  Into  a total  of  at  aoat 
3ar  quadrature  Intagrala.  Alao  recall  froa  Appendix  A that  the  nuabar 
of  quadrature  Intagrala  will  autoaatlcally  be  reduced  to  the  alnlaua 
nuaber  aa  unneceaaary  "aodea”  will  be  aultlplled  by  aero  "coaponent" 
aatrleea  In  the  coaponent  repreaentatlon  of  e^  and  ao  theae 
unnecaaaary  aodea  can  be  ellalnated.  Finally,  we  coaeient  that  tech- 
nlquea  alallar  to  thoae  dlacuaaed  In  Section  IV. 3 nay  be  applied  here 
to  yield  a alnlaal  dlaenalon  differential  equation  deacrlptlon  of  the 
aacond  order  aanaltlvlty  ayatem.  Such  an  approach  would  be  an  alterna- 
tive to  alnlaal-order  aanaltlvlty  aodela  of  Blngulac  (Ref  34)  and 
Varahney  (Ref  40),  and  would  yield  the  abaolute  minimum  number  of 
differential  equatlona. 


AFAL-TR-76-118 


BZBLZOGIAPBT 


for  tho  convonloneo  of  tho  roador  v«  havo  dlvldad  tho  Bibliography 
into  Mjor  catagorlaa  of  rafarancaa.  Slnca  Cha  dlaaarcatloo  covara  a 
falrly  broad  ranga  of  araaa  la  ayataas  chaory.  It  la  bopad  ehac  this 
organisation  will  assist  tha  raadar  in  locating  tha  rafarancaa  for 
which  ha  la  particularly  Intaraatad.  This  Bibliography  la  by  no  aaana 
axhaustlva  but  aaraly  llata  cha  rafarancaa  which  ara  particularly  rala- 
want  or  llluatratlva  to  tha  points  nada  In  tha  body  of  cha  dlsaartatlon. 
Thaaa  catagorlaa  ara  tha  following:  A.  Ganaral  Rafarancaa,  B.  MlnlaMl 
Ordar  Hodals  for  Sansltlvlty,  C.  Elganvalua  Sansltlvlty,  D.  Znsanslclw- 
Ity  and  Invarlanca,  E.  Mlnlnnm  Sansltlvlty  Control  Latra,  F.  Ganaral 
Rafarancaa  on  Paranetar  Identification,  G.  Idantlflablllty,  H.  Input 
Design  for  Improved  Parameter  Identification,  and  I.  Computation  with 
Valah  Functions. 


A.  Ganaral  Rafarancaa 

1.  Baras,  J.  S.,  R.  W.  Brockatt,  and  P.  A.  Fuhrmann.  "State  Space 

Models  of  Infinite  Dimensional  Systems",  IEEE  Trans.  Auto.  Control. 
AC-19:  693-700,  Dec  1974. 

2.  Bellman,  R.  Introduction  to  Matrix  Analysis.  New  York:  McGraw  R:^ll, 
2nd  Edition,  1970. 

3.  Blum,  E.  K.  Numerical  Analysis  and  Computation  Theory  and  Practices. 
Reading,  Hass:  Addlson-Wesley,  1972. 

4.  Bode,  H.  U.  Network  Analysis  and  Feedback  Amplifier  Design.  New 
York:  Van  NosCrand,  1945. 

5.  Bryson,  A.  E.,  and  Y.  C.  Ho.  Applied  Optimal  Control  Theory.  Waltham, 
Mass:  Ginn  Company,  1966. 

6.  Cruz,  J.  B.,  Jr.,  Editor.  System  Sensitivity  Analysis.  Stroudsburg, 

Pa:  Dowden  Hutchinson  Ross,  1973. 

7.  Cruz,  J.  B.,  Jr.,  and  W.  R.  Perkins.  "A  New  Approach  to  Che  Sensitiv- 

ity Problem  in  Multivariable  Feedback  System  Design",  IEEE  Trans.  Auto. 
Control.  AC-19:  216-223,  1964. 


APAL-TR-76-118 


C««kl,  F.  G.  "SoM  Notes  on  the  Inversion  of  Confluent  Vsnderaonde 
)Utrlces'\  IEEE  Trsns.  Auto.  Control.  AC-20:  154-157,  Feb  1975. 

Delfour,  M.  C.  end  S.  K.  Hitter.  "Controllsblllty,  Obser\eblllty 
end  Optimal  Feedback  Control  of  Affine  Hereditary  Differential  Sys- 
tems", SIAM  J.  Control.  10,  2:  298-328,  May  1972. 

Delfour,  M.  C.,  and  S.  K.  Hitter.  "Controllability  and  Observability 
for  Infinite  Dimensional  Systems",  SIAM  J.  Control.  10,  2:  329-333, 

May  1972. 

Faddeev,  0.  K.,  and  V.  N.  Faddeeva.  Computational  Methods  of  Linear 
Algebra,  trans  by  R.  C.  Willlama,  San  Francisco:  W.  H.  Freeman 
Pub.,  1963. 

Fel'dbaum,  A.  A.  Optimal  Control  Systems.  New  York:  Acaddklc  Press, 
1965. 

Goknar,  I.  C.  "Obtaining  the  Inverse  of  the  Generalized  Vandermonde 
Matrix  of  the  Most  General  Type",  IEEE  Trans.  Auto.  Control.  AC-18: 
530-532,  Oct  1973. 

Horowitz,  I.  M.  Synthesis  of  Feedback  Systests.  New  York:  Academic 
Press,  1963. 

Horowitz,  I.  M.,  and  0.  Shaked.  "Superiority  of  Transfer  Function 
Over  State-Variable  Methods  in  Linear  Time-Invariant  Feedback  System 
Design",  IEEE  Trans.  Auto.  Control.  AC-20:  84-97,  Feb  1975. 

Kalman,  R.  E.  "Mathematical  Description  of  Linear  Dynamical  Systems", 
SIAM  J.  Control.  1:  152-192,  1963. 

Kalman,  R.  E.,  Y.  C.  Ho,  and  K.  S.  Narendra.  "Controllability  of 
Linear  Dynamical  Systems",  Contributions  to  Differential  Eqtiatlons. 

1,  2:  189-213,  1963. 

Lee,  R.  C.  K.  Optimal  Estimation.  Identification,  and  Control. 

M.  I.  T.  Press,  1964. 

Llustemlk,  L.  A.,  and  W.  I.  Sobolev.  Elements  of  Functional  Analysis. 
New  York:  Frederick  Ungar  Pub.  Co.,  1961. 

Luenberger,  D.  G.  Optimization  by  Vector  Space  Methods.- New  York: 

John  Wiley,  1969. 

Nerlng,  E.  D.  Linear  Algebra  and  Matrix  Theory.  New  York:  John 
Wiley,  1970. 

Pontryagln,  L.  S.,  V.  G.  Boltyanskll,  R.  V.  Gamkrelidze,  and  E.  F. 
Mishchenko.  The  Mathematical  Theory  of  Optimal  Processes.  (Trans  by 
K.  N.  Trlrogoff,  ed.  by  L.  W.  Neustadt),  New  York:  John  Wiley,  1962. 


23.  Porter,  W.  A.  "Minimizing  System  Sensitivity  Through  Feedback",  J. 
Franklin  Institute,  286,  3:  225-229. 


187 


J 


AFAL-TR-76-U8 


24.  Rosen,  J.  "The  Credlenc  Projection  Method  for  Nonlinear  Prograsailns. 

Part  I.  Linear  Constraints",  J.  Soc.  Indus.  Appl.  Math..  8;  181-217, 

1960. 

25.  Royden,  H.  L.  Real  Analysis.  New  York:  The  MacMillan  Co.,  2nd  Edi- 
tion, 1968. 

26.  Silverman,  L.  M.,  and  H.  E.  Meadows.  "Controllability  and  Time- 
Variable  Unilateral  Networks",  IEEE  Trans.  Circuit  Theory.  CT-12: 
308-312,  Sep  1965. 

27.  Taylor,  A.  E.  Introduction  to  Functional  Analysis.  New  York:  John 
tflley,  1958. 

28.  Tomovlc,  R.  Sensitivity  Analysis  of  Dynamic  Systems.  New  l^rk: 

McGraw  Hill,  1963. 

29.  Tomovlc,  R.,  and  M.  Vukobratovlc.  Cmneral  Sensitivity  Theory.  New 
York:  American  Elsevier,  1972. 

30  Tse,  E.,and  Y.  Bar-Shalom.  "An  Actively  Adaptive  Control  for  Linear 
Systems  with  Random  Parameters  via  the  Dual  Control  Approach",  IEEE 
Trans.  Auto.  Control.  AC-18:  109-116,  Apr  1973. 

31.  Van  Trees,  H.  L.  Detection,  Estimation,  and  Modulation  Theory.  Part  I, 
New  York:  John  Vlley,  l968. 

32.  Weiss,  R.  "Use  of  Semi-Croups  for  Derivation  of  Controllability  and 
Observability  In  Time- Invariant  Linear  Systems”,  Int.  J.  Control. 

18,  3:  475-479,  1^73. 

33.  Zadeh,  L.  A.,  and  C.  A.  Desoer.  Linear  System  Theory.  New  York: 

McGraw  Hill,  1963. 


B.  Minimal  Order  Models  for  Parameter  Sensitivity 

34.  Blngulac,  S.  P.  "Simultaneous  Generation  of  the  Second  Order  Sensitiv- 
ity Functions",  IEEE  Trans.  Auto.  Control.  AC-11:  563-566,  Jul  1966. 

35.  Denary,  D.  G.  "Simplification  In  the  Computation  of  the  Sensitivity 
Functions  for  Constant  Coefficient  Line- r Systems",  IEEE.  Trans.  Auto. 
Control.  AC-16,  3.  Aug  1971. 

36.  Guardabassl,  C. , A.  Locatelll,  and  S.  Rinaldi.  "On  the  Optimality 
of  the  Wllkle-Ferklns  Low-Order  Sensitivity  Model", >IEEE  Trans.  Auto. 

’ Control.  AC-15:  382-384,  Jun  1970. 

37.  Rokotovlc,  P.  V.  "Methods  of  Sensitivity  Points  In  the  Investigation 
of  Linear  Control  Systems”,  Automatika  i’  Telemekhanika.  25,  12: 
1670-1676,  Dec  1964. 

38.  Neumann,  C.  P.,  and  A.  K.  Sood.  "Sensitivity  Functions  for  Multi- 

Input  Linear  Time  Invariant  Systems",  Int.  J.  Control.  13,  6:  1137- 

1150,  1971. 


188 


APAL-TR-76-188 


39.  HeuMnn,  C,  P.,  mad  A.  K.  Sotxl.  "Senate Ivlty  Punctlona  for  Hultl- 
Input  Linear  Tine- Invariant  Syatana  - II.  Mlnlawl  Order  Models", 

Int.  J.  Control.  15,  3:  451-463,  1972. 

40.  Varshney,  R.  K.  "teduced-Ordcr  Sensitivity  Models  of  Linear  Sys- 
tens  with  Applications",  Ph.  D.  Dissertation,  University  of  Illinois, 
Urbana,  1972. 

41.  Vllkle,  D.  P.  and  W.  R.  Perklna.  "Essential  Paraaeters  In  Sensitiv- 
ity Analysis”,  Autonatlca.  5:  191-197,  1969. 

42.  Vllkle,  D.  F.  and  V.  R.  Perklna.  "Generation  of  Sensitivity  Func- 

tions for  Linear  Systeas  Using  Low-Order  Models",  IEEE  Trans.  Auto. 
Control.  AC-14:  123-130,  1969. 


C.  glgenvalue  Sensitivity 

43.  Crossley,  T.  R.  srd  E.  Porter.  "Eigenvalue  and  Eigenvector  Sensitiv- 
ities In  Linear  Systent-  Theory",  Int.  J.  Control.  10,  2:  163-170, 

1969. 

44.  Crossley,  T.  R.  and  B.  Porter.  "Hlgh-Order  Elgenproblea  Sensitiv- 

ity Methods:  Theory  and  Application  to  the  Design  of  Linear  Dynaalcal 
Systems",  Int.  J.  Control.  10,  3:  315-329,  1969. 

45.  Mantey,  P.  E.  "Eigenvalue  Sensitivity  and  State-Variable  Selection", 

IEEE  Trans.  Auto.  Control.  AC-13:  263-269,  Jun  1968. 

» 

46.  Morgan,  B.  S.,  Jr.  "Sensitivity  Analysis  and  Synthesis  of  Multi- 

variable  Systems",  IEEE  Trans.  Auto.  Control.  AC-11:  506-512,  1966. 

47.  Singer,  R.  A.  "Selecting  State  Variables  to  Minimize  Eigenvalue 

Sensitivity  of  Multivariable  SystesM",  Autonatlca.  5:  85-93,  1969. 


D.  Sensitivity  Controllability  and  Uncontrollablllty 

48.  Bertele,  U.,  and  G.  Gtiardabassl.  "Can  the  Terminal  Condition  of 

Time-Invariant  Linear  Control  Systems  be  Made  Immune  Against  Small 
Parameter  Variations?",  IEEE  Trans.  Auto.  Control,  AC-16:  460- 

462,  Oct  1971. 

49.  Guardabassl,  G.,  A.  Locatelll,  and  S.  Rinaldi.  "Structural  Uncon- 
trollablllty of  Sensitivity  Systems",  Automatlca.  5:  297-301, 

May  1969. 

50.  Guardabassl,  G.,  A.  Locatelll,  and  S.  Rinaldi.  "Further  Results 
On  Controllability  and  Sensitivity",  IEEE  Trans.  Auto.  Control. 

AC-15:  697-698,  Dec  1970. 

51.  Gudrdabassi,  G.,  A.  Locatelll,  and  S.  Rinaldi.  "Structural  Proper- 
ties of  Sensitivity  Systems",  J.  Franklin  Institute,  294,  4:  241- 

248,  Oct  1972. 

y 


189 


APAL-TR-76-118 


OuardcbMslt  G. , A.  Loc«t*lllt  and  S>  Rinaldi.  **111#  Rola  of  Con- 
trollability in  Soaa  Senaitivity  Problcaa”,  Int.  J.  Control.  19 t 1 
57-64,  1974. 


Sarvaragi,  T. , J.  Inoua,  and  K.  Aaai.  "Synthaala  of  Opan-loop 
Optimal  Control  with  Zero  Senaitivity  Terminal  Conatralnta”, 
Automatica.  5:  389-394,  Hay  1969. 


E.  Inaenaitivity  and  Invariance 


Aggarval,  J.  K. , H.  T.  Banka,  and  N.  H.  McClamroch 
in  Linear  Syatema",  J.  Hath.  Anal,  and  AppI..  29: 


55.  Bonlvento,  C.  "Structural  Insensitivity  Versus  Identlfiataility' 
IEEE  Trans.  Auto.  Control.  AC-18;  190-192,  Apr  1973. 


Bonlvento,  C. , R.  Culdorzl,  and  6.  Harro.  "Parameter  Insensitiv- 
ity and  Controlled  Invariance",  Third  IPAC  Symposium  on  Sensitivif 
Adaptivity  and  Optimality.  Ischia,  Italy,  Jun  18-21,  1973. 


Cruz,  J.  B.  and  W.  R.  Perkins.  "Conditions  for  Signal  and  Param- 
eter Invariance  in  Dynamical  Systems",  IEEE  Trans.  Auto.  Control. 
AC-11:  614-615,  Jul  1966. 


Fabian,  E.  and  W.  H.  Uonham.  "Decoupling  and  Data  Sensitivity' 
IFBR  Trans.  Auto.  Control.  AC-20:  338-344,  Jun  1975. 


Fronza,  G.  and  A.,  Locatelll.  "Insensitivity  by  Linear  Feedback' 
J.  Franklin  Institute.  296,  4:  237-247,  Oct  1973. 


60.  Karlin,  A.,  A.  Locatelll,  and  C.  Zanardlnl.  "Trajectory  Insensitiv- 
ity via  Feedback",  Automat ica.  10:  517-524,  1974. 


Gtiardabassi,  G. , A.  Locatelll,  and  S.  Rinaldi.  "Insensitivity  and 
Parameterlc  Invariance  in  Linear  Systems",  Int.  Rep  LCA  71-9, 
Institute  di  Elletrotecnica  ed  Elettronlca,  Politecnico  dl  Milano, 
Apr  1971. 


62.  Guardabassl,  G.,  A.  Locatelll,  and  S.  Rinaldi.  "Parameterlc 
Invariance  of  Linear  Systems",  Richerche  di  Automatica.  3,  1 
Apr  1972. 


63.  Guardabassl,  G.,  A.  Locatelll,  and  S.  Rinaldi.  "Zero-Sensitivity 
in  Linear  Systems",  Int.  J.  Control.  15,  4:  785-791,  1972. 


Guardabassl,  G. , A.  Locatelll,  and  S.  Rinaldi.  "Signal  Insensitiv- 
ity, Controllability,  and  Observability  in  Linear  Systems",  IEEE 
Trans.  Auto.  Control.  AC-16;  277-278,  Jun  1970. 


Karlin,  A.,  A.  Locatelll,  and  C.  Zanardini.  "Trajectory  Insensitiv- 
ity via  Feedback",  Automatica.  10:  517-524,  1974. 


APAL-TR-76-118 


66.  Khruscalev,  M.  M.  "Necessary  and  Sufficient  Conditions  for  Weak 

Invariance",  Autoaatlka  1 Telemekhanlka.  4:  17-22,  Apr  1968. 

67.  Rozoner,  L.  I.  '"A  Variational  Approach  to  the  Problem  of  Invariance 
of  Automatic  Control  Systems",  Automat Ika  I Telemekhanlka.  24,  6: 
744-756,  Jun  1963. 


P.  Minimum  Sensitivity  Control  Laws 

68.  Guardabassl,  G.,  A.  Locatelll^ and  S.  Rinaldi.  "On  the  Optimization 
of  Continuous  Linear  Systems  with  Sensitivity  Constraints",  Pre- 
prints Second  IPAC  Symposium  on  System  Sensitivity  and  Adaptivity. 
Dubrovnik,  Yugslavla,  Aug  1968. 

69.  Holtzman,  J.  M.,  and  S.  Boring.  "The  Sensitivity  of  Terminal  Condi- 

tions of  Optimal  Control  Systems  to  Parameter  Variations",  IEEE 
Trans.  Auto.  Control.  AC-10:  420-426.  Oct  1965. 

70.  Kahne,  S.  "Lov-Sensltlvlty  Design  of  Optimal  Linear  Control  Sys- 
tems", IEEE  Trans.  Aerospace  and  Elect.  Systems.  AES-4:  374-379, 

May  1968. 

71.  Rwakemaak,  H.  "Optimal  Low-Sensltlvlty  Linear  Peedback  Systems", 
Automatlca.  5:  279-285,  1969. 

72.  Lament,  G.^and  S.  Kahne.  "Comparison  of  Sensitivity  Improvement 
Techniques  for  Linear  Optimal  Control  Systems",  IEEE.  Trans.  Aero- 
space and  Elect.  Systems.  AES-5:  142-151,  Mar  1969. 

73.  Luh,  J.  "Minimum  Trajectory  Sensitivity  of  a Large  Launch  Booster 
Control  System",  IEEE  Trans.  Aerospace  and  Elect.  Systems.  AES-5: 
213-220,  Mar  1969. 

74.  Pederson,  K.  C.,and  L.  R.  Nardlzzl.  "Adaptive  Control  Using  Opti- 
mally Sensitive  Design  Techniques",  Int.  J.  Control.  17,  2:  435- 

447,  1973. 

75.  Price,  C.,  and  J.  Deyst.  "A  Method  for  Obtaining  Declred  Sensitiv- 
ity Characteristics  with  Optimal  Controls",  Proceedings  Joint  Auto. 
Control  Conf..  University  of  Michigan,  1968. 

76.  Rohrer,  R.^ and  M.  Sobral,  Jr.  "Sensitivity  Considerations  In  Optimal 

System  Design",  IEEE  Trans.  Auto.  Control.  AC-10:  43-48,  Jan  1965. 

77.  Stavroulakls,  P.,  and  P.  Sarachlk.  "Low  Sens It IvltjT  Feedback  Gains 

for  Deterministic  and  Stochastic  Control  Systems",  Int.  J.  Control. 
19,  1:  15-31,  1974. 

78.  Zadlcarlo,  J.,ar.(l  R.  Swan.  "The  Optimal  Control  of  Linear  Systems 

Trtth  Unknown  Parameters",  IEEE  Trans.  Auto.  Control.  AC-11:  423- 

426,  Jul  1966.  - 


191 


APAL-TR-76-118 

6.  General  References  on  Parameter  Identification 

79.  Allison,  J.  S.  "On  the  Comparison  of  Two  Methods  of  Off-Line  Param- 
eter Identification",  J.  Math.  Anal,  and  Appl..  18:  229-237,  1967. 

• 

80.  Astrom,  K.  J.,  and  P.  Eykhoff.  "System  Identification  - A Survey", 

Automatlca.  7:  123-162.  1971. 

81.  Banks,  H.  T.,  and  G.  M.  Groome.  "Convergence  Theorems  for  Parameter 
Estimation  by  Quaslllnearlzatlon",  J.  Math.  Anal,  and  Appl..  42: 
91-109,  1969. 

82.  Cuenod,  M.,  and  A.  P.  Sage.  "Comparison  of  Some  Methods  Used  for 

Process  Identification",  Automatlca.  4:  235-269,  1968. 

83.  Eykhoff,  P.  "Process  Parameter  and  State  Estimation",  Automatlca. 

4:  205-233,  1968. 

84.  Gupta,  N.  K.,and  R.  K.  Mehra.  "Computational  Aspects  of  Maximum 

Likelihood  Estimation  and  Reduction  In  Sensitivity  Function  Calcula- 
tions", IEEE  Trans.  Auto.  Control.  AC-19:  774-783,  Dec  1974. 

85.  Kltzerc,  R.  A.  "System  Level  IMU  Calibration",  Proceedings  of 
Symposium  on  A.  F.  Appl.  of  Modern  Control  Theory.  Wrlght-Patterson 
AFB,  Ohio,  Jul  1974. 

86.  Kumar,  K.  S.  P.^and  R.  Srldhar.  "On  the  Identification  of  Control 

Systems  by  the  Quasl-Llnearlzatlon  Method",  IEEE  Trans.  Auto.  Control. 
AC-9:  151-154,  Apr  1964. 

87.  Sarldls,  G.  N.  "Stochastic  Identification  Methods  for  Identifica- 
tion and  Control  - A Survey",  IEEE  Trans.  Auto.  Control.  AC-19: 
798-809,  Dec  1974. 


H.  Identlf lability 

88.  Aokl,  M.,  and  P.  C.  Tue.  "On  Certain  Convergence  Questions  In 

System  Identification",  SIAM  J.  Control.  8,  2:  239-256. 

• 

89.  Bellman,  R., and  K.  J.  Astrom.  "On  Structural  Identlf lability". 

Mathematical  Blosclences.  7:  313-328,  1970. 

90.  Denham,  M.  J.  "Canonical  Forms  for  the  Identification  of  Multl- 

Varlabie  Linear  Systems",  IEEE  Trans.  Auto.  Control.  AC-19:  646- 

656,  Dec  1974. 

91.  Glover,  R.  "Structural  Aspects  of  System  Identification",  Ph.D. 
Dissertation,  M.I.T.,  Aug  1973. 

92.  Glover,  K.  "Some  Geometrical  Properties  of  Linear  Systems  with 
Implications  In  Identification",  Submitted  to  IFAC  Congress,  Boston, 
Aug  1975. 


192 


AFAL-TR-76-118 


Glover,  K.^and  J.  C.  Wlllcma.  "PartuneterlzaCions  of  Linear  Dynamical 
Syeteme:  Canonical  Forms  and  Identlf lability",  IEEE  Trans.  Auto. 
Control.  AC-19:  640-645,  Dec  1974. 

Hoberock,  L.  L.  and  G.  U.  Stewart.  "Input  Requirements  and  Param- 
etric Errors  for  System  Identification  Under  Periodic  Excitation", 
Trans,  of  the  ASME.  296,  Dec  1972. 

Kim,  C.  H.,  and  D.  P.  Llndorff.  "Input  Frequency  Requirements  for 
Identification  Through  Liapunov  Methods",  Int.  J.  Control.  20,  1: 
35-48,  1974. 

Martenson,  K.  "Least  Squares Identlf lability  of  Dynamic  Systems", 
Tech.  l^t.  RB73-44,  Dept.  Elec.  Eng.,  USC,  Los  Angeles,  CA,  Aug  1973. 

Msyne,  D.  Q.  "A  Canonical  Model  for  Identification  of  Multivariable 
Linear  Systems",  IEEE  Trans.  Auto.  Control.  AC-17:  728-731,  Oct 

1972. 

Mehra,  R.  R.  "Identification  In  Control  and  Econometrics:  Similar- 
ities and  Differences",  Tech.  Rpt.  No.  647,  Dlv.  Eng.  and  Applied 
Physics,  Harvard  Unlv. , Cambridge,  MA,  Jul  1973. 

Payne,  R.  L.,  and  G.  C.  Goodwin.  "On  the  Identlflablllty  of  Linear 
Dynamic  Systems",  Int.  J.  Control,  20,  5:  865-868,  1974. 

Popov,  V.  M.  "Invariant  Description  of  Linear  Time-Invariant 
Controllable  Systras",  SIAM  J.  Control.  10,  2:  252-264,  1974. 

Staley,  R.  M.,  and  P.  C.  Yue.  "On  System  Parameter  Identlflablllty", 
Information  Sciences.  2:  127-138,  1970. 

Tse,  E.  "Information  Matrix  and  Local  Identlflablllty  of  Param- 
eters", Proceedings  Joint  Auto.  Control  Conf..  Ohio  State  Unlv., 
Columbus,  OH,  1973. 

Tse,  E.,  and  J.  J.  Anton.  "On  the  Identlflablllty  of  Parameters", 

IEEE  Trans.  Auto.  Control.  AC-17:  637-646,  Oct  1972. 


I.  Input  Design  for  Improved  Parameter  Identification 

104.  Aokl,  M.,  and  R.  M.  Staley.  "On  Input  Signal  Synthesis  In  Parameter 

Identification",  Automatlca.  6:  431-440,  1970. 

105.  Barker,  H.  A.,  and  D.  Raeslde.  "Linear  Modeling  of  Multivariable 
Systems  with  Pseudo-Random  Binary  Input  Slgiuls",  Automatlca.  4: 
393-416,  1968. 

106.  Funahashl,  Y.,  and  K.  Nakamura.  "Identification  of  Discrete-Time 
Systems  Using  a Composite  Sinusoidal  System",  Int.  J.  Control. 

18,  5:  945-962,  1973. 


AFAL-TR-76-118 


107.  Goodwin,  G.  C.,  J.  C.  Murdoch,  and  R.  L.  Payne.  "Optimal  Test  Signal 
Design  for  Linear  S.I.S.O.  System  Identification",  Int.  J.  Control. 

17,  3;  45-55,  1973. 

108.  Gupta,  M.  K.>and  V.  E.  Hall.  "Input  Design  for  Identification  of 
Aircraft  Stability  and  Control  Derivatives",  HASA-CR-2493,  Feb  1975. 

109.  Kalaba,  R.  E.,  and  K.  Splngam.  "Optimal  Inputs  and  Sensitivities 
for  Parameter  Estimation",  J.  Opt.  Theory  and  Appl..  11,  Is  56-67, 
1973. 

110.  Kalaba,  R.  E.,aiid  K.  Splngam.  "Optimal  Input  System  Identification 

for  Homogeneous  and  Non-Homogeneous  Boundary  Conditions",  J.  Opt. 
Theory  and  Appl. . 16,  5:  487-496,  Sep  1975. 

111.  Levadl,  V.  S.  "Design  of  Input  Signals  for  Parameter  Estimation", 

IEEE  Trans.  Auto.  Control,  AC-11:  205-211,  Apr  1966. 

112.  Mehra,  R.  K.  "Optimal  Inputs  for  Linear  System  Identification", 

Preprints,  1972  Joint  Auto.  Control  Conf..  Stanford,  CA,  Aug  1972; 
also  IEEE  Trans.  Auto.  Control,  AC-19:  192-200,  Jun  1974. 

113.  Mehra,  R.  K.  "Frequency-Domain  S)mthesls  of  Optimal  Inputs  for  Linear 
System  Parameter  Estimation",  TR645,  Division  of  Engineering  and 
Applied  Physics,  Harvard  Unlv. , Jul  1973. 

114.  Mehra,  R.  K.  "Optimal  Input  Signals  for  Parameter  Estimation  In 

Dynamic  Systems  - Survey  and  New  Results",  IEEE  Trans.  Auto.  Control. 
AC-19:  753-768,  Dec  1974. 

115.  Mehra,  R.  K.  "Time-Domain  Synthesis  of  Optimal  Inputs  for  System 
Identification",  Preprints  IEEE  Decision  and  Control  Conf..  Phoenix, 
AZ,  Dec  1974. 

116.  Mehra,  R.  K.  "Frequency-Domain  Synthesis  of  Optimal  Inputs  for 
Multllnput-Multloutput  (MIMO)  Synthesis  with  Process  Noise",  Pre- 
prints IEEE  Decision  and  Control  Conf..  Phoenix,  AZ,  Dec  1974. 

117.  Nahl,  N,  E.,  and  G.  A.  Napjus.  "Design  of  Optimal  Probing  Signals 
for  Vector  Parameter  Estimation",  Preprints  IEEE  Decision  and  Control 
Conf . . Miami,  FL,  Dec  1971. 

118.  Nahl,  N.  E.,  and  D.  E.  Wallis,  Jr.  "Optimal  Inputs  for  Parameter 
Estimation  in  Dynamic  Systems  with  White  Observation  Noise",  Pre- 
prints. Joint  Auto.  Control  Conf..  Boulder,  CO,  Au^  1969. 

119.  Reid,  D.  B.  "Optimal  Inputs  for  System  Identification",  Ph.D. 
Dissertation,  Stanford  Unlv.,  Stanford,  CA,  SUDAAR  440,  May  1972. 

120.  Stepner,  D.  E.,  and  R.  R.  Mehra.  "Maximum  Likelihood  Identification 
and  Optimal  Input  Design  for  Identifying  Aircraft  Stability  and 
Control  Derivatives",  NASA  Contractor  Rpt.,  NASA  CR-2200,  Mar  1973. 


194 


AFAL-TR-76-118 


121.  Swanson,  G.  D.,  and  J.  W.  Bellvllle.  "Forcing  Function  Selection  for 
Identification  In  Human  Respiratory  Control",  Preprints.  IEEE  Deci- 
sion and  Control  Conf..  San  Diego,  CA,  Dec  1973. 

122.  Zarrop,  M.  B.,  and  G.  C.  Goodwin.  "Comments  on  Optimal  Inputs  for 

System  Identification",  IEEE  Trans.  Auto.  Control.  AC-20:  299-300, 

Apr  1975. 


J.  Computation  with  Walsh  Functions 

123.  Chen,  C.  F.^  and  C.  H.  Hsiao.  "Walsh  Series  Analysis  in  Optimal 

Control",  Int.  J.  Control.  21,  6:  881-898,  Jun  1975. 

124.  Chen,  C.  F.,  and  C.  H.  Hsiao.  "Design  of  Piecewise  Constant  Gains  for 

Optimal  Control  via  Walsh  Functions",  IEEE  Trans.  Auto.  Control. 
AC-20,  5:  596-602,  Oct  1975. 

125.  Corrlngton,  M.  S.  "Solution  of  Differential  and  Integral  Equations 
with  Walsh  Functions",  IEEE  Trans,  on  Circuit  Theory.  CT-20,  5: 
470-476,  Sep  1973. 

126.  Walsh,  J.  L.  "A  Closed  Set  of  Orthogonal  Functions",  American  J. 

Math..  45:  5-24,  1923. 


195 

☆ u.  ‘i.  i/)VfHWMrNr  orrici  ; — 7'j7-ooi/8.‘ 


